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ARTIFICIAL ANTIBODY POLYPEPTIDES 




FIELD OF THE INVENTION 



The present invention relates / generally to the field of the production and 
5 selection of binding and catalytic polypeptides by the methods of molecular biology, 
using both combinatorial chemistry and recombinant DNA. The invention 
specifically relates to the generation of both nucleic acid and polypeptide libraries 
derived therefrom encoding the molecular scaffolding of Fibronectin Type m (Fn3) * 
modified in one or more of its loop regions. The invention also relates to the 
10 "artificial mini-antibodies" or "monobodies " i.e., the polypeptides comprising an 
Fn3 scaffold onto which loop regions capable of binding to a variety of different 
molecular structures (such as antibody binding sites) have been grafted. 

BACKGROUND OF THE INVENTION 

15 Antibody structure 

A standard antibody (Ab) is a tetrameric structure consisting of two identical 
immunoglobulin (Ig) heavy chains and two identical light chains. The heavy and 
light chains of an Ab consist of different domains. Each light chain has one variable 
domain (VL) and one constant domain (CL), while each heavy chain has one 

20 variable domain (VH) and three or four constant domains (CH) (Alzari et al. f 1988). 
Each domain, consisting of - 1 10 amino acid residues, is folded into a characteristic 
P-sandwich structure formed from two p-sheets packed against each other, the 
immunoglobulin fold. The VH and VL domains each have three complementarity 
determining regions (CDR1-3) that are loops, or turns, connecting P-strands at one 

25 end of the domains (Fig. 1 : A, C). The variable regions of both the light and heavy 
chains generally contribute to antigen specificity, although the contribution of the 
individual chains to specificity is not always equal. Antibody molecules have 

1 



evolved to bind to number of molecules by using six Jw^mized loops 

(CDRs). However, the size of the antibodies and the complexity of six loops 
represents a major design hurdle if the end result is to be a relatively small peptide 
ligand. 

5 

Antibody substructures 

Functional substructures of Abs can be prepared by proteolysis and by 
recombinant methods. They include the Fab fragment, which comprises the VH- 
CH1 domains of the heavy chain and the VL-CL1 domains of the light chain joined 

10 by a single interchain disulfide bond, and the Fv fragment, which comprises only the 
VH and VL domains. In some cases, a single VH domain retains significant affinity 
(Ward et al., 1989). It has also been shown that a certain monomeric k light chain 
will specifically bind to its cognate antigen. (L. Masat et al., 1994). Separated light 
or heavy chains have sometimes been found to retain some antigen-binding activity 

15 (Ward et al., 1989). These antibody fragments are not suitable for structural analysis 
using NMR spectroscopy due to their size, low solubility or low conformational 
stability. 

Another functional substructure is a single chain Fv (scFv), comprised of the 
variable regions of the immunoglobulin heavy and light chain, covalently connected 

20 by a peptide linker (S-z Hu et al., 1996). These small (M r 25,000) proteins generally 
retain specificity and affinity for antigen in a single polypeptide and can provide a 
convenient building block for larger, antigen-specific molecules. Several groups 
have reported biodistribution studies in xenografted athymic mice using scFv 
reactive against a variety of tumor antigens, in which specific tumor localization has 

25 been observed. However, the short persistence of scFvs in the circulation limits the 
exposure of tumor cells to the scFvs, placing limits on the level of uptake. As a 
result, tumor uptake by scFvs in animal studies has generally been only l-5%DD/g as 
opposed to intact antibodies that can localize in tumors ad 30-40 %ID/g and have 
reached levels as high as 60-70 %DD/g. 
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A small proteifrlcaffold called a "minibody" was designed using a part of 
the Ig VH domain as the template (Pessi et al., 1993). Minibodies with high affinity 
(dissociation constant (K d ) - 10" 7 M) to interleukin-6 were identified by randomizing 
loops corresponding to CDR1 and CDR2 of VH and then selecting mutants using 

5 the phage display method (Martin et al., 1994). These experiments demonstrated 
that the essence of the Ab function could be transferred to a smaller system. 
However, the minibody had inherited the limited solubility of the VH domain 
(Bianchi et al., 1994). 

It has been reported that camels (Camelus dromedarius) often lack variable 

10 light chain domains when IgG-like material from their serum is analyzed, suggesting 
that sufficient antibody specificity and affinity can be derived form VH domains 
(three CDR loops) alone. Davies and Riechmann recently demonstrated that 
"camelized" VH domains with high affinity (K,, - 10" 7 M) and high specificity can 
be generated by randomizing only the CDR3. To improve the solubility and 

15 suppress nonspecific binding, three mutations were introduced to the framework 
region (Davies & Riechmann, 1995). It has not been definitively shown, however, 
that camelization can be used, in general, to improve the solubility and stability of 
VHs. 

An alternative to the "minibody" is the "diabody." Diabodies are small 
20 bivalent and bispecific antibody fragments, i.e., they have two antigen-binding sites. 
The fragments comprise a heavy-chain variable domain (V H ) connected to a light- 
chain variable domain (VJ on the same polypeptide chain (Vh-VJ. Diabodies are 
similar in size to an Fab fragment. By using a linker that is too short to allow 
pairing between the two domains on the same chain, the domains are forced to pair 
25 with the complementary domains of another chain and create two antigen-binding 
sites. These dimeric antibody fragments, or "diabodies," are bivalent and bispecific. 
P. Holliger et al., PNAS 90:6444-6448 (1993). 

Since the development of the monoclonal antibody technology, a large 
number of 3D structures of Ab fragments in the complexed and/or free states have 
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ly^r^tallography (Webster et al., 1994; 



been solved by X-ray^stallography (Webster et al., 1994; WflUn & Stanfield, 
1994). Analysis of Ab structures has revealed that five out of the six CDRs have 
limited numbers of peptide backbone conformations, thereby permitting one to 
predict the backbone conformation of CDRs using the so-called canonical structures 
5 (Lesk & Tramontane 1992; Rees et al., 1994). The analysis also has revealed that 
the CDR3 of the VH domain (VH-CDR3) usually has the largest contact surface and 
that its conformation is too diverse for canonical structures to be defined; VH-CDR3 
is also known to have a large variation in length (Wu et al., 1993). Therefore, the 
structures of crucial regions of the Ab-antigen interface still need to be 
10 experimentally determined. 

Comparison of crystal structures between the free and complexed states has 
revealed several types of conformational rearrangements. They include side-chain 
p rearrangements, segmental movements, large rearrangements of VH-CDR3 and 

,~ changes in the relative position of the VH and VL domains (Wilson & Stanfield, 

W 15 1993). In the free state, CDRs, in particular those which undergo large 
01 conformational changes upon binding, are expected to be flexible. Since X-ray 

r l crystallography is not suited for characterizing flexible parts of molecules, structural 

* studies in the solution state have not been possible to provide dynamic pictures of 

Q] the conformation of antigen-binding sites. 

u 20 

Mimicking the antibodv-binding site 

CDR peptides and organic CDR mimetics have been made (Dougall et al., 
1994). CDR peptides are short, typically cyclic, peptides which correspond to the 
amino acid sequences of CDR loops of antibodies. CDR loops are responsible for 
25 antibody-antigen interactions. Organic CDR mimetics are peptides corresponding to 
CDR loops which are attached to a scaffold, e.g., a small organic compound. 

CDR peptides and organic CDR mimetics have been shown to retain some 
binding affinity (Smyth & von Itzstein, 1994). However, as expected, they are too 
small and too flexible to maintain full affinity and specificity. Mouse CDRs have 
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been grafted onto the^man Ig framework without the loss of l^mity (Jones et al., 
1986; Riechmann et al., 1988), though this "humanization" does not solve the 
above-mentioned problems specific to solution studies. 



5 Mimicking natural selection p rocesses of Abs 

In the immune system, specific Abs are selected and amplified from a large 
library (affinity maturation). The processes can be reproduced in vitro using 
combinatorial library technologies. The successful display of Ab fragments on the 
surface of bacteriophage has made it possible to generate and screen a vast number 
10 of CDR mutations (McCafferty et ah, 1990; Barbas et al., 1991; Winter etal., 1994). 
An increasing number of Fabs and Fvs (and their derivatives) is produced by this 
O technique, providing a rich source for structural studies. The combinatorial 

technique can be combined with Ab mimics. 

A number of protein domains that could potentially serve as protein scaffolds 
15 have been expressed as fusions with phage capsid proteins. Review in Clackson & 
Wells, Trends Biotechnol. 12:173-184 (1994). Indeed, several of these protein 
domains have already been used as scaffolds for displaying random peptide 
sequences, including bovine pancreatic trypsin inhibitor (Roberts et al., PNAS 
89:2429-2433 (1992)), human growth hormone (Lowman et al., Biochemistry 
20 30: 10832-10838 (1991)), Venturini et al., Protein Peptide Letters 1:70-75 (1994)), 
and the IgG binding domain of Streptococcus (O'Neil et al M Techniques in Protein 
Chemistry V (Crabb, L„ ed.) pp. 517-524, Academic Press, San Diego (1994)). 
These scaffolds have displayed a single randomized loop or region. 

Researchers have used the small 74 amino acid a-amylase inhibitor 
25 Tendamistat as a presentation scaffold on the filamentous phage M13 (McConnell 
and Hoess, 1995). Tendamistat is a p-sheet protein from Streptomyces tendae. It 
has a number of features that make it an attractive scaffold for peptides, including its 
small size, stability, and the availability of high resolution NMR and X-ray 
structural data. Tendamistat' s overall topology is similar to that of an 
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immunoglobulin domain, with two P-sheets connected by a series of loops. In 
contrast to immunoglobulin domains, the p-sheets of Tendamistat are held together 
with two rather than one disulfide bond, accounting for the considerable stability of 
the protein. By analogy with the CDR loops found in immunoglobulins, the loops 
5 the Tendamistat may serve a similar function and can be easily randomized by in 
vitro mutagenesis. 

Tendamistat, however, is derived from Streptomyces tendae. Thus, while 
Tendamistat may be antigenic in humans, its small size may reduce or inhibit its 
antigenicity. Also, Tendamistat' s stability is uncertain. Further, the stability that is 
10 reported for Tendamistat is attributed to the presence of two disulfide bonds. 

Disulfide bonds, however, are a significant disadvantage to such molecules in that 
they can be broken under reducing conditions and must be properly formed in order 
to have a useful protein structure. Further, the size of the loops in Tendamistat are 
relatively small, thus limiting the size of the inserts that can be accommodated in the 
15 scaffold. Moreover, it is well known that forming correct disulfide bonds in newly 
01 synthesized peptides is not straightforward. When a protein is expressed in the 

p cytoplasmic space of E. coli, the most common host bacterium for protein 

? overexpression, disulfide bonds are usually not formed, potentially making it 

S difficult to prepare large quantities of engineered molecules. 

P 20 Thus, there is an on-going need for small, single-chain artificial antibodies 

Q for a variety of therapeutic, diagnostic and catalytic applications. 

O 

SUMMARY OF THE INVENTION 

The invention provides a fibronectin type III (Fn3) polypeptide monobody 
25 comprising a plurality of Fn3 P-strand domain sequences that are linked to a 
plurality of loop region sequences. One or more of the monobody loop region 
sequences of the Fn3 polypeptide vary by deletion, insertion or replacement of at 
least two amino acids from the corresponding loop region sequences in wild-type 
Fn3. The p-strand domains of the monobody have at least about 50% total amino 
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acid sequence homology to the corresponding amino acid sequence of wild-type 
Fn3's p-strand domain sequences. Preferably, one or more of the loop regions of the 
monobody comprise amino acid residues: 

i) from 15 to 16 inclusive in an AB loop; 
5 ii) from 22 to 30 inclusive in a BC loop; 

iii) from 39 to 45 inclusive in a CD loop; 

iv) from 5 1 to 55 inclusive in a DE loop; 

v) from 60 to 66 inclusive in an EF loop; and 

vi) from 76 to 87 inclusive in an FG loop. 

10 The invention also provides a nucleic acid molecule encoding a Fn3 

polypeptide monobody of the invention, as well as an expression vector comprising 
said nucleic acid molecule and a host cell comprising said vector. 

The invention further provides a method of preparing a Fn3 polypeptide 
monobody. The method comprises providing a DNA sequence encoding a plurality 

15 of Fn3 p-strand domain sequences that are linked to a plurality of loop region 
sequences, wherein at least one loop region of said sequence contains a unique 
restriction enzyme site. The DNA sequence is cleaved at the unique restriction site. 
Then a preselected DNA segment is inserted into the restriction site. The 
preselected DNA segment encodes a peptide capable of binding to a specific binding 

20 partner (SBP) or a transition state analog compound (TS AC). The insertion of the 
preselected DNA segment into the DNA sequence yields a DNA molecule which 
encodes a polypeptide monobody having an insertion. The DNA molecule is then 
expressed so as to yield the polypeptide monobody. 

Also provided is a method of preparing a Fn3 polypeptide monobody, which 

25 method comprises providing a replicatable DNA sequence encoding a plurality of 
Fn3 P-strand domain sequences that are linked to a plurality of loop region 
sequences, wherein the nucleotide sequence of at least one loop region is known. 
Polymerase chain reaction (PCR) primers are provided or prepared which are 
sufficiently complementary to the known loop sequence so as to be hybridizable 
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under PCR conditions^herein at least one of the primers contains a modified 
nucleic acid sequence to be inserted into the DNA sequence. PCR is performed 
using the replicatable DNA sequence and the primers. The reaction product of the 
PCR is then expressed so as to yield a polypeptide monobody. 

The invention further provides a method of preparing a Fn3 polypeptide 
monobody. The method comprises providing a replicatable DNA sequence 
encoding a plurality of Fn3 p-strand domain sequences that are linked to a plurality 
of loop region sequences, wherein the nucleotide sequence of at least one loop 
region is known. Site-directed mutagenesis of at least one loop region is performed 
so as to create an insertion mutation. The resultant DNA comprising the insertion 
mutation is then expressed. 

Further provided is a variegated nucleic acid library encoding Fn3 
polypeptide monobodies comprising a plurality of nucleic acid species encoding a 
plurality of Fn3 P-strand domain sequences that are linked to a plurality of loop 
region sequences, wherein one or more of the monobody loop region sequences vary 
by deletion, insertion or replacement of at least two amino acids from corresponding 
loop region sequences in wild-type Fn3, and wherein the p-strand domains of the 
monobody have at least a 50% total amino acid sequence homology to the 
corresponding amino acid sequence of p-strand domain sequences of the wild-type 
Fn3. The invention also provides a peptide display library derived from the 
variegated nucleic acid library of the invention. Preferably, the peptide of the 
peptide display library is displayed on the surface of a bacteriophage, e.g., a Ml 3 
bacteriophage or a fd bacteriophage, or virus. 

The invention also provides a method of identifying the amino acid sequence 
of a polypeptide molecule capable of binding to a specific binding partner (SBP) so 
as to form a polypeptide:SSP complex, wherein the dissociation constant of the said 
polypeptide:SBP complex is less than 10* 6 moles/liter. The method comprises the 



steps of: 



a) 



providing a peptide display library of the invention; 
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LC^f the peptide display library of (a) with^^n 



b) contact^ the peptide display library of (a) with an immobilized or 
separable SBP; 

c) separating the peptide:SBP complexes from the free peptides; 

d) causing the replication of the separated peptides of (c) so as to result 
5 in a new peptide display library distinguished from that in (a) by 

having a lowered diversity and by being enriched in displayed 
peptides capable of binding the SBP; 
e ) optionally repeating steps (b), (c), and (d) with the new library of (d); 
and 

10 f) determining the nucleic acid sequence of the region encoding the 

displayed peptide of a species from (d) and hence deducing the 
peptide sequence capable of binding to the SBP. 
The present invention also provides a method of preparing a variegated 
nucleic acid library encoding Fn3 polypeptide monobodies having a plurality of 
15 nucleic acid species each comprising a plurality of loop regions, wherein the species 
encode a plurality of Fn3 0-strand domain sequences that are linked to a plurality of 
loop region sequences, wherein one or more of the loop region sequences vary by 
deletion, insertion or replacement of at least two amino acids from corresponding 
loop region sequences in wild-type Fn3, and wherein the p-strand domain sequences 
20 of the monobody have at least a 50% total amino acid sequence homology to the 
corresponding amino acid sequences of p-strand domain sequences of the wild-type 
Fn3, comprising the steps of 

a) preparing an Fn3 polypeptide monobody having a predetermined 
sequence; 

25 b) contacting the polypeptide with a specific tending partner (SBP) so 

as to form a polypeptide:SSP complex wherein the dissociation 
constant of the said polypeptide:SBP complex is less than 10 -6 
moles/liter; 
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c) determining the binding structure of the polypeptide:SBP complex by 
nuclear magnetic resonance spectroscopy or X-ray crystallography; 
and 

d) preparing the variegated nucleic acid library, wherein the variegation 
is performed at positions in the nucleic acid sequence which, from 
the information provided in (c), result in one or more polypeptides 
with improved binding to the SBP. 

Also provided is a method of identifying the amino acid sequence of a 
polypeptide molecule capable of catalyzing a chemical reaction with a catalyzed rate 
constant, k^, and an uncatalyzed rate constant, k^, such that the ratio of Kz/Kaczt 
is greater than 10. The method comprises the steps of; 

a) providing a peptide display library of the invention; 

b) contacting the peptide display library of (a) with an immobilized or 
separable transition state analog compound (TS AC) representing the 
approximate molecular transition state of the chemical reaction; 

c) separating the peptide:TS AC complexes from the free peptides; 

d) causing the replication of the separated peptides of (c) so as to result 
in a new peptide display library distinguished from that in (a) by 
having a lowered diversity and by being enriched in displayed 
peptides capable of binding the TS AC; 

e) optionally repeating steps (b), (c), and (d) with the new library of (d); 
and 

0 determining the nucleic acid sequence of the region encoding the 
displayed peptide of a species from (d) and hence deducing the 
peptide sequence. 

The invention also provides a method of preparing a variegated nucleic acid 
library encoding Fn3 polypeptide monobodies having a plurality of nucleic acid 
species each comprising a plurality of loop regions, wherein the species encode a 
plurality of Fn3 p-strand domain sequences that are linked to a plurality of loop 
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region sequences, wherein one or more of the loop region sequences vary by 
deletion, insertion or replacement of at least two amino acids from corresponding 
loop region sequences in wild-type Fn3, and wherein the P-strand domain sequences 
of the monobody have at least a 50% total amino acid sequence homology to the 
corresponding amino acid sequences of p-strand domain sequences of the wild-type 
Fn3, comprising the steps of 

a) preparing an Fn3 polypeptide monobody having a predetermined 
sequence, wherein the polypeptide is capable of catalyzing a 
chemical reaction with a catalyzed rate constant, k^, and an 
uncatalyzed rate constant, k^, such that the ratio of k^/k^ is 
greater than 10; 

b) contacting the polypeptide with an immobilized or separable 
transition state analog compound (TSAC) representing the 
approximate molecular transition state of the chemical reaction; 

c) determining the binding structure of the polypeptide:TSAC complex 
by nuclear magnetic resonance spectroscopy or X-ray 
crystallography; and 

d) preparing the variegated nucleic acid library, wherein the variegation 
is performed at positions in the nucleic acid sequence which, from 
the information provided in (c), result in one or more polypeptides 
with improved binding to or stabilization of the TSAC. 

The invention also provides a kit for the performance of any of the methods 
of the invention. The invention further provides a composition, e.g., a polypeptide, 
prepared by the use of the kit, or identified by any of the methods of the invention. 

The following abbreviations have been used in describing amino acids, 
peptides, or proteins: Ala, or A, Alanine; Arg, or R, Arginine; Asn or N, asparagine; 
Asp, or D, aspartic acid; Cysor C, cystein; Gin, or Q, glutamine; Glu, or E, glutamic 
acid; Gly, or G, glycine; His, or H, histidine; De, or I, isoleucine; Leu, or L, leucine; 
Lys, or K, lysine; Met, or M, methionine; Phe, or F, phenylalanine; Pro, or P, 
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proline; Ser, or S, ser^Prhr, or T, threonine; Tip, or W, trypt^fei; Tyr, or Y, 
tyrosine; Val, or V, valine. 

The following abbreviations have been used in describing nucleic acids, 
DNA, or RNA: A, adenosine; T, thymidine; G, guanosine; C, cytosine. 

BRIEF DESCRIPTION OF THE DRAWINGS 

figure 1. p-Strand and loop topology (A, B) and MOLSCREPT 
representation (C, D; Kraulis, 1991) of the VH domain of anti-lysozyme 
immunoglobulin D1.3 (A, C; Bhat et ah, 1994) and 10th type m domain of human 
fibronectin (B>D; Main et al., 1992). The locations of complementarity determining 
regions (CDRs, Hypervariable regions) and the integrin-binding Arg-Gly-Asp (RGD) 
sequence are indicated. ^ r ' °' 

Figure 2. Amino acid sequence^and restriction sites of the synthetic Fn3 
gene. The residue numbering is according to Main et al. (1992). Restriction 
enzyme sites designed are shown above the amino acid sequence, p -Strands are 
denoted by underlines. The N-terminal "mq" sequence has been added for a 
subsequent cloning into an expression vector. The Hisnag (N^a^n) fusion protein 
has an additional sequence, MGSSHHHHHHSSGLVPRGSH/preceding thrFn3 
sequence shown above. 

jre 3. A, Far UV CD spectra of wild-type Fn3 at 25 °C and 90°C. Fn3 
(50 \iM) wa^dissolved in sodium acetate (50 mM, pH 4.6). B, thermal denaturation 
of Fn3 monitor^ at 215 nm. Temperature was increased at a rate of 1 °C/min. 

*Kteure 4. A, Ca trace of the crystal structure of the complex of lysozyme 
(HEL) andfcte Fv fragment of the anti-hen egg-white lysozyme (anti-HEL) antibody 
DL3 (Bhat et\. 1994). Side chains of the residues 99-102 of VH CDR3, which 
make contact witiriffiL, are also shown. B, Contact surface area for each residue of 
the Dl .3 VH-HEL anchVH-VL interactions plotted vs. residue number of Dl .3 VH. 
Surface area and secondarysStructure were determined using the program DSSP 
(Kabsh and Sander, 1983). CSnd D, schematic drawings of the P-sheet structure of 
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the F strand-loop-G strand moieties of D 1.3 VH (C) and Fn3 (I^^he boxes denote 
residues in P-strands and ovals those not in strands. The shaded boxes indicate 
^ residues of which side chains are significantly buried. The broken lines indicate 

hydroge^onds. <|«tf>*> = 1 * ^ ^'fe^ 

^\ 5 Wure 5. Designed Fn3 gene showing DNA and amino acid sequence^ The 

^\ amino acicN^mbering is according to Main etal. (1992). The two loops that were 

randomized ir\ombinatorial libraries are enclosed in boxes. 

Figure 6. Map of plasmid pAS45. Plasmid pAS45 is the expression vector 
of His # tag-Fn3. 

10 Figure 7. Map of plasmid pAS25. Plasmid pAS25 is the expression vector 

ofFn3. 

„ Figure 8. Map of plasmid pAS38. pAS38 is a phagmid vector for the 

surface display of Fn3. 

Figure 9. (Ubiquitin-1) Characterization of ligand-specific binding of 
15 enriched clones using phage enzyme-linked immunosolvent assay (ELISA). 

Microtiter plate wells were coated with ubiquitin (1 jig/well; "Ligand (+)) and then 
blocked with BSA. Phage solution in TBS containing approximately 10 10 colony 
forming units (cfu) was added to a well and washed with TBS. Bound phages were 
detected with anti-phage antibody-POD conjugate (Pharmacia) with Turbo-TMB 
20 (Pierce) as a substrate. Absorbance was measured using a Molecular Devices 

SPECTRAmax 250 microplate spectrophotometer. For a control, wells without the 
immobilized ligand were used. 2-1 and 2-2 denote enriched clones from Library 2 
eluted with free ligand and acid, respectively. 4-1 and 4-2 denote enriched clones 
from Library 4 eluted with free ligand and acid, respectively. 
25 Figure 10. (Ubiquitin-2) Competition phage ELISA of enriched clones. 

Phage solutions containing approximately 10 10 cfu were first incubated with free 
ubiquitin at 4°C for 1 hour prior to the binding to a ligand-coated well. The wells 
were washed and phages detected as described above. 
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Figure 11. Competition phage ELISA of ubiquitin-binding monobody 411. 
Experimental conditions are the same as described above for ubiquitin. The ELISA 
was performed in the presence of free ubiquitin in the binding solution. The 
experiments were performed with four different preparations of the same^clone^ ^ 'M^ 

5 ( Figure 12. (Fluorescein- 1) Phage ELISA of four clones, pLB25.L pLB25.4 „ /f ^ r 6<^U) 
pLB24.1 ( v s andpLB24.^ fexperimental conditions are the same as ubiquitin- 1 above. ^ ' J 

Figure 13. (Fluorescein-2) Competition ELISA of the four clones. 
Experimental conditions are the same as ubiquitin-2 above. 

Figure 14. l H, l5 N-HSQC spectrum of a fluorescence-binding monobody 
10 LB25.5. Approximately 20 |iM protein was dissolved in 10 mm sodium acetate 

buffer (pH 5.0) containing 100 mM sodium chloride. The spectrum was collected at 
30°C on a Varian Unity INOVA 600 NMR spectrometer. 

figure 15. Characterization of the binding reaction of Ubi4-Fn3 to the 
target, ubRauitin. (a) Phage ELISA analysis of binding of Ubi4-Fn3 to ubiquitin. 
15 The binding\f Ubi4-phages to ubiquitin-coated wells was measured. The control 
experiment waStoerformed with wells containing no ubiquitin. 

(b) Competition phage ELISA of Ubi4-Fn3. Ubi4-Fn3-phages were 
preincubated with soluble ubiquitin at an indicated concentration, followed by the 
phage ELISA detection in ubiquitin-coated wells. 
20 (c) Competition phage ELISA testing the specificity of the Ubi4 clone. The 

Ubi4 phages were preincubated with 250 ng/ml of soluble proteins, followed by 
phage ELISA as in (b). 

(d) EUS A using free proteins. 

Figure 16. Equilibrium unfolding curves for Ubi4-Fn3 (closed symbols) and 
25 wild-type Fn3 (open symbols). Squares indicate data measured in TBS (Tris HC1 
buffer (50 mM, pH 7.5) containing NaCl (150 mM)). Circles indicate data 
measured in Gly HC1 buffer (20 mM, pH 3.3) containing NaCI (300 mM). The 
curves show the best fit of the transition curve based on the two-state model. 
Parameters characterizing the transitions are listed in Table 7. 



14 



H - : 



Ms 



Figure 17. (a^^ l5 N-HSQC spectrum of [ ,5 N]-Ubi4-I^^. (b). Difference 
(6wiid-iypc " ^ub.4) of * H ( b ) and 15n ( c ) chemical shifts plotted versus residue number. 
Values for residues 82-84 (shown as filled circles) where Ubi4-K deletions are set to 
zero. Open circles indicate residues that are mutated in the Ubi4-K protein. The 
5 locations of p-strands are indicated with arrows. 

DETAILED DESCRIPTION OF THE INVENTION 

For the past decade the immune system has been exploited as a rich source 
of de novo catalysts. Catalytic antibodies have been shown to have 

10 chemoselectivity, enantioselectivity, large rate accelerations, and even an ability to 
reroute chemical reactions. In most cases the antibodies have been elicited to 
transition state analog (TS A) haptens. These TSA haptens are stable, low-molecular 
weight compounds designed to mimic the structures of the energetically unstable 
transition state species that briefly (approximate half-life 10* 13 s) appear along 

15 reaction pathways between reactants and products. Anti-TSA antibodies, like 

natural enzymes, are thought to selectively bind and stabilize transition state, thereby 
easing the passage of reactants to products. Thus, upon binding, the antibody lowers 
the energy of the actual transition state and increases the rate of the reaction. These 
catalysts can be programmed to bind to geometrical and electrostatic features of the 

20 transition state so that the reaction route can be controlled by neutralizing 

unfavorable charges, overcoming entropic barriers, and dictating stereoelectronic 
features of the reaction. By this means even reactions that are otherwise highly 
disfavored have been catalyzed (Janda et aL 1997). Further, in many instances 
catalysts have been made for reactions for which there are no known natural or man- 

25 made enzymes. 

The success of any combinatorial chemical system in obtaining a particular 
function depends on the size of the library and the ability to access its members. 
Most often the antibodies that are made in an animal against a hapten that mimics 
the transition state of a reaction are first screened for binding to the hapten and then 
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screened again for catSytic activity. An improved method allows for the direct 
selection for catalysis from antibody libraries in phage, thereby linking chemistry 
and replication. 

A library of antibody fragments can be created on the surface of filamentous 
phage viruses by adding randomized antibody genes to the gene that encodes the 
phage's coat protein. Each phage then expresses and displays multiple copies of a 
single antibody fragment on its surface. Because each phage possesses both the 
surface-displayed antibody fragment and the DNA that encodes that fragment, and 
antibody fragment that binds to a target can be identified by amplifying the 
associated DNA. 

Immunochemists use as antigens materials that have as little chemical 
reactivity as possible. It is almost always the case that one wishes the ultimate 
antibody to interact with native structures. In reactive immunization the concept is 
just the opposite. One immunizes with compounds that are highly reactive so that 
upon binding to the antibody molecule during the induction process, a chemical 
reaction ensues. Later this same chemical reaction becomes part of the mechanism 
of the catalytic event. In a certain sense one is immunizing with a chemical reaction 
rather than a substance per se. Reactive immunogens can be considered as 
analogous to the mechanism-based inhibitors that enzymologists use except that 
they are used in the inverse way in that, instead of inhibiting a mechanism, they 
induce a mechanism. 

Man-made catalytic antibodies have considerable commercial potential in 
many different applications. Catalytic antibody-based products have been used 
successfully in prototype experiments in therapeutic applications, such as prodrug 
activation and cocaine inactivation, and in nontherapeutic applications, such as 
biosensors and organic synthesis. 

Catalytic antibodies are theoretically more attractive than noncatalytic 
antibodies as therapeutic agents because, being catalytic, they may be used in lower 
doses, and also because their effects are unusually irreversible (for example, peptide 
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bond cleavage rather man binding). In therapy, purified catalytic antibodies could 
be directly administered to a patient, or alternatively the patient's own catalytic 
antibody response could be elicited by immunization with an appropriate hapten. 
Catalytic antibodies also could be used as clinical diagnostic tools or as 
5 regioselective or stereoselective catalysts in the synthesis of fine chemicals. 

I. Mutation of Fn3 loops and grafting of Ab loops onto Fn3 

An ideal scaffold for CDR grafting is highly soluble and stable. It is small 
enough for structural analysis, yet large enough to accommodate multiple CDRs so 
10 as to achieve tight binding and/or high specificity. 

A novel strategy to generate an artificial Ab system on the framework of an 
existing non-Ab protein was developed. An advantage of this approach over the 
minimization of an Ab scaffold is that one can avoid inheriting the undesired 
properties of Abs. Fibronectin type IH domain (Fn3) was used as the scaffold. 

15 Fibronectin is a large protein which plays essential roles in the formation of 

t~ ■ 

extracellular matrix and cell-cell interactions; it consists of many repeats of three 
J types (I, II and IE) of small domains (Baron et al., 1991). Fn3 itself is the paradigm 

of a large subfamily (Fn3 family or s-type Ig family) of the immunoglobulin 
superfamily (IgSF). The Fn3 family includes cell adhesion molecules, cell surface 
20 hormone and cytokine receptors, chaperonins, and carbohydrate-binding domains 
(for reviews, see Bork & Doolittle, 1992; Jones, 1993; Bork et al., 1994; Campbell 
& Spitzfaden, 1994; Harpez & Chothia, 1994). 

Recently, crystallographic studies revealed that the structure of the DNA 
binding domains of the transcription factor NF-kB is also closely related to the Fn3 
25 fold (Ghosh et al., 1995; Miiller et al., 1995). These proteins are all involved in 

specific molecular recognition, and in most cases ligand-binding sites are formed by 
surface loops, suggesting that the Fn3 scaffold is an excellent framework for 
building specific binding proteins. The 3D structure of Fn3 has been determined by 
NMR (Main et al., 1992) and by X-ray crystallography (Leahy et al., 1992; 
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Dickinson et al M 1994)7 The structure is best described as a p-sandwich similar to 
that of Ab VH domain except that Fn3 has seven P-strands instead of nine (Fig. 1). 
There are three loops on each end of Fn3; the positions of the BC, DE and FG loops 
approximately correspond to those of CDR1, 2 and 3 of the VH domain, 
5 respectively (Fig. 1 C, D). 

Fn3 is small (~ 95 residues), monomeric, soluble and stable. It is one of few 
members of IgSF that do not have disulfide bonds; VH has an interstrand disulfide 
bond (Fig. 1 A) and has marginal stability under reducing conditions. Fn3 has been 
expressed in E. coli (Aukhil et al., 1993). In addition, 17 Fn3 domains are present 
10 just in human fibronectin, providing important information on conserved residues 
which are often important for the stability and folding (for sequence alignment, see 
Main et al., 1992 and Dickinson et al., 1994). From sequence analysis, large 
variations are seen in the BC and FG loops, suggesting that the loops are not crucial 
to stability. NMR studies have revealed that the FG loop is highly flexible; the 
~^ xx 15 flexibility has been implicated for the spepific binding of the 10th Fn3 to <x 5 pj 

integrin through the Arg-Gly-Asp (RGD)^motif. In the crystal structure of human 
growth hormone-receptor complex (de Vos et al M 1992), the second Fn3 domain of 
the receptor interacts with hormone via the FG and BC loops, suggesting it is 
feasible to build a binding site using the two loops. 
20 The tenth type IE module of fibronectin has a fold similar to that of 

immunoglobulin domains, with seven p strands forming two antiparallel P sheets, 
which pack against each other (Main et al., 1992). The structure of the type EL 
module consists of seven p strands, which form a sandwich of two antiparallel P 
sheets, one containing three strands (ABE) and the other four strands (C'CFG) 
25 (Williams et al., 1988). The triple-stranded P sheet consists of residues Glu-9-Thr- 
14 (A), Ser-17-Asp-23 (B), and Thr-56-Ser-60 (E). The majority of the conserved 
residues contribute to the hydrophobic core, with the invariant hydrophobic residues 
Trp-22 and Try-68 lying toward the N-terminal and C-terminal ends of the core, 
respectively. The p strands are much less flexible and appear to provide a rigid 
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framework upon whictffunctional, flexible loops are built. The topology is similar 
to that of immunoglobulin C domains. 



Gene construction and mutagenesis 

A synthetic gene for tenth Fn3 of human fibronectin (Fig. 2) was designed 
which includes convenient restriction sites for ease of mutagenesis and uses specific 
codons for high-level protein expression (Gribskov et al M 1984). 

The gene was assembled as follows: (1) the gene sequence was divided into 
five parts with boundaries at designed restriction sites (Fig.2); (2) for each part, a 
pair of oligonucleotides that code opposite strands and have complementary 
overlaps of ~ 15 bases was synthesized; (3) the two oligonucleotides were annealed 
and single strand regions were filled in using the Klenow fragment of DNA 
polymerase; (4) the double-stranded oligonucleotide was cloned into the pET3a 
vector (Novagen) using restriction enzyme sites at the termini of the fragment and 
its sequence was confirmed by an Applied Biosystems DNA sequencer using the 
dideoxy termination protocol provided by the manufacturer; (5) steps 2-4 were 
repeated to obtain the whole gene (plasmid pAS25) (Fig. 7). 

Although the present method takes more time to assemble a gene than the 
one-step polymerase chain reaction (PCR) method (Sandhu et al., 1992), no 
mutations occurred in the gene. Mutations would likely have been introduced by the 
low fidelity replication by Taq polymerase and would have required time-consuming 
gene editing. The gene was also cloned into the pET15b (Novagen) vector (pEWl). 
Both vectors expressed the Fn3 gene under the control of bacteriophage T7 promoter 
(Studler et al. 1990); pAS25 expressed the 96-residue Fn3 protein only, while pEWl 
expressed Fn3 as a fusion protein with poly-histidine peptide (His«tag). 
Recombinant DNA manipulations were performed according to Molecular Cloning 
(Sambrook et al., 1989), unless otherwise stated. 

Mutations were introduced to the Fn3 gene using either cassette mutagenesis 
or oligonucleotide site-directed mutagenesis techniques (Deng & Nickoloff, 1992). 
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Cassette mutagenesis^as performed using the same protocol for gene construction 
described above; double-stranded DNA fragment coding a new sequence was cloned 
into an expression vector (pAS25 and/or pEWl). Many mutations can be made by 
combining a newly synthesized strand (coding mutations) and an oligonucleotide 
used for the gene synthesis. The resulting genes were sequenced to confirm that the 
designed mutations and no other mutations were introduced by mutagenesis 
reactions. 



Design and synthesis of Fn3 mutants with antibody CDRs 

Two candidate loops (FG and BC) were identified for grafting. Antibodies 
with known crystal structures were examined in order to identify candidates for the 
sources of loops to be grafted onto Fn3. Anti-hen egg lysozyme (HEL) antibody 
D1.3 (Bhat et al., 1994) was chosen as the source of a CDR loop. The reasons for 
this choice were: (1) high resolution crystal structures of the free and complexed 
states are available (Fig. 4 A; Bhat et al., 1,994), (2) thermodynamics data for the 
binding reaction are available (Tello et al., 1993), (3) D1.3 has been used as a 
paradigm for Ab structural analysis and Ab engineering (Verhoeyen et al., 1988; 
McCafferty et al., 1990) (4) site-directed mutagenesis experiments have shown that 
CDR3 of the heavy chain (VH-CDR3) makes a larger contribution to the affinity 
than the other CDRs (Hawkins et al., 1993), and (5) a binding assay can be easily 
performed. The objective for this trial was to graft VH-CDR3 of D 1.3 onto the Fn3 
scaffold without significant loss of stability. 

An analysis of the D1.3 structure (Fig. 4) revealed that only residues 99-102 
("RDYR") make direct contact with hen egg-white lysozyme (HEL) (Fig. 4 B), 
although VH-CDR3 is defined as longer (Bhat et al., 1994). It should be noted that 
the C-terminal half of VH-CDR3 (residues 101-104) made significant contact with 
the VL domain (Fig. 4 B). It has also become clear that D 1 .3 VH-CDR3 (Fig. 4 C) 
has a shorter turn between the strands F and G than the FG loop of Fn3 (Fig. 4 D). 
Therefore, mutant sequences were designed by using the RDYR (99-102) of D 1.3 as 
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the core and made diflfrent boundaries and loop lengths (TableTX Shorter loops 
may mimic the D1.3 CDR3 conformation better, thereby yielding higher affinity, but 
they may also significantly reduce stability by removing wild-type interactions of 
Fn3. 



Table 1. Amino acid sequences of D 1.3 VH CDR3, VH8 CDR3 and Fn3 FG loop 
and list of planned mutants. 



96 100 



105 



^ 10 



ft 



D1.3 
VH8 

Fn3 



Mutant 
Dl.3-1 
D 1.3-2 
D 1.3-3 
D 1.3-4 
D 1.3-5 
D 1.3-6 
D 1.3-7 
VH8-1 
VH8-2 



ARERDYRLDYWGQG /OrOj^j 
ARGAVVSYYAMJDI1GQG [f(*K 10 AO:"Xj 
7S 80 85 



XA^TGRGDSPASSKPI IB **>:j) 

Sequence \ 

YAERDYRLDY PI f{^***'>°\) 

YAVRDYRLDY PI jfi^-Sj 

YAVRDYRLDYASSKP i/Vtek 1 0 9 

YAVRDYRLDY KPlQ^ i0 ^'"Y 

YAVRDYR SKPI |0 ^ a 

YAVTRDYRL — SSKPI ,S> 
YAVTERDYRL-SSKPIU^' l -A 
YAVAVVS YYAMDY-PI ' D ^1J 
YAVTAVVS YYASSKPI 10 ^-\^ 



Underlines indicate residues in P-strands. Bold 
characters indicate replaced residues. 
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In addition, ananti-HEL single VH domain termed VH8 (Ward et al., 1989) 
was chosen as a template. VH8 was selected by library screening and, in spite of the 
lack of the VL domain, VH8 has an affinity for HEL of 27 nM, probably due to its 
longer VH-CDR3 (Table 1). Therefore, its VH-CDR3 was grafted onto Fn3. 
5 Longer loops may be advantageous on the Fn3 framework because they may provide 
higher affinity and also are close to the loop length of wild-type Fn3. The 3D 
structure of VH8 was not known and thus the VH8 CDR3 sequence was aligned 
with that of D1.3 VH-CDR3; two loops were designed (Table 1). 
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10 Mutant construction and production 

Site-directed mutagenesis experiments were performed to obtain designed 
sequences. Two mutant Fn3s, D 1.3-1 and D 1.3-4 (Table 1) were obtained and both 
were expressed as soluble His*tag fusion proteins. D 1.3-4 was purified and the 
His«tag portion was removed by thrombin cleavage. D 1.3-4 is soluble up to at least 
15 1 mM at pH 7.2. No aggregation of the protein has been observed during sample 
preparation and NMR data acquisition. 

Protein expression and purification 

E. coli BL21 (DE3) (Novagen) were transformed with an expression vector 

20 (pAS25, pEWl and their derivatives) containing a gene for the wild-type or a 

mutant. Cells were grown in M9 minimal medium and M9 medium supplemented 
with Bactotrypton (Difco) containing ampicillin (200 jig/ml). For isotopic labeling, 
l5 N NH 4 CI and/or l3 C glucose replaced unlabeled components. 500 ml medium in a 
2 liter baffle flask were inoculated with 10 ml of overnight culture and agitated at 

25 37°C. Isopropylthio-p-galactoside (HTG) was added at a final concentration of 1 
mM to initiate protein expression when OD (600 nm) reaches one. The cells were 
harvested by centrifugation 3 hours after the addition of IPTG and kept frozen at - 
70°C until used. 
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Fn3 without Hisnag was purified as follows. Cells were suspended in 
5 ml/(&cell) of Tris (50 mM, pH 7.6) containing ethylenediaminetetraacetic acid 
(EDTAA 1 mM) and phenylmethylsulfonyl fluoride (1 mM). HEL was added to a 
final concentration of 0.5 mg/ml. After incubating the solution for 30 minutes at 
37°C, it wks sonicated three times for 30 seconds on ice. Cell debris was removed 
by centrifugation. Ammonium sulfate was added to the solution and precipitate 
recovered by\centrifugation. The pellet was dissolved in 5-10 ml sodium acetate 
(50 mM, pH 46) and insoluble material was removed by centrifugation. The 
solution was applied to a Sephacryl S 100HR column (Pharmacia) equilibrated in the 
10 sodium acetate buffer. Fractions containing Fn3 then was applied to a Resources 
column (Pharmacia) equilibrated in sodium acetate (50 mM, pH 4.6) and eluted with 
a linear gradient offcodium chloride (0-0.5 M). The protocol can be adjusted to 
purify mutant proteins with different surface charge properties. 

Fn3 with HisnVg was purified as follows. The soluble fraction was prepared 
1 5 as described above, exdept that sodium phosphate buffer (50 mM, pH 7.6) 

containing sodium chloride (100 mM) replaced the Tris buffer. The solution was 
applied to a Hi-Trap chelating column (Pharmacia) preloaded with nickel and 
equilibrated in the phosphate buffer. After washing the column with the buffer, 
His»tag-Fn3 was eluted in the\phosphate buffer containing 50 mM EDTA. Fractions 
20 containing His*tag-Fn3 were pooled and applied to a Sephacryl S 100-HR column, 
yielding highly pure protein. ThV His^tag portion was cleaved off by treating the 
fusion protein with thrombin usin&the protocol supplied by Novagen. Fn3 was 
separated from the His*tag peptide a^d thrombin by a Resources column using the 
protocol above. 

25 The wild-type and two mutant proteins so far examined are expressed as 

soluble proteins. In the case that a mutant is expressed as inclusion bodies 
(insoluble aggregate), it is first examined ifc it can be expressed as a soluble protein 
at lower temperature (e.g., 25-30°C). If this\s not possible, the inclusion bodies are 
collected by low-speed centrifugation following cell lysis as described above. The 
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peftet is washed with^^er, sonicated and centrifuged. The in^^ion bodies are 
solubiWd in phosphate buffer (50 mM, pH 7.6) containing guanidinium chloride 
(GdnCI, 6M) and will be loaded on a Hi-Trap chelating column. The protein is 
eluted with theHmffer containing GdnCI and 50 mM EDTA. 

5 

Conformation of mutant Fn3. Dl.3-4 

The ! H NMR spectra of HisHag Dl.3-4 fusion protein closely resembled that 
of the wild-type, suggesting the mutant is folded in a similar conformation to that of 
the wild-type. The spectrum of Dl.3-4 after the removal of the HisHag peptide 

10 showed a large spectral dispersion. A large dispersion of amide protons (7-9.5 ppm) 
and a large number of downfield (5.0-6.5 ppm) C a protons are characteristic of a P- 
sheet protein (Wuthrich, 1986). 

The 2D NOESY spectrum of Dl.3-4 provided further evidence for a 
preserved conformation. The region in the spectrum showed interactions between 

15 upfield methyl protons (< 0.5 ppm) and methyl-methylene protons. The VaI72 y 
methyl resonances were well separated in the wild-type spectrum (-0.07 and 
0.37 ppm; (Baron et ah, 1992)). Resonances corresponding to the two methyl 
protons are present in the Dl.3-4 spectrum (-0.07 and 0.44 ppm). The cross peak 
between these two resonances and other conserved cross peaks indicate that the two 

20 resonances in the D 1 .3-4 spectrum are highly likely those of Val72 and that other 
methyl protons are in nearly identical environment to that of wild-type Fn3. Minor 
differences between the two spectra are presumably due to small structural 
perturbation due to the mutations. Val72 is on the F strand, where it forms a part of 
the central hydrophobic core of Fn3 (Main et al., 1992). It is only four residues 

25 away from the mutated residues of the FG loop (Table 1). The results are 

remarkable because, despite there being 7 mutations and 3 deletions in the loop 
(more than 10% of total residues; Fig. 12, Table 2), Dl.3-4 retains a 3D structure 
virtually identical to that of the wild-type (except for the mutated loop). Therefore, 
the results provide strong support that the FG loop is not significantly contributing 
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aDmtv of the Fn3 molecule and thus that the K3 



to the folding and stabmty of the Fn3 molecule and thus that the K3 loop can be 
mutated extensively. 



Table 2. Sequences of oligonucleotides 



Name Sequence 
FN1F 
FN1R 



Sequence , 1 

CGGG ATCCCAJ^TGC AGGTTTCTG ATGTTCCGCGTG ACCTGG A AGTTGTTGCTGCG ACC • 0 0/0 ' ' V 

t-....x TAA CTGCAGG AGCATCCCAGCTGATCAGCAGGCTAGTCGGGGTCGCAGCAACAAC /<fA I/) 
FN2F CTCCTECAgTTACCGTGCGTTATTACCGTATCACGTACC^ liWO'lSj 
1 0 FN2R G TGAATTCC TXjAACCGGGGAGTTACCACCGGTTTCACCG £ <fti /D 

FN3F AGGAATTCACTGTACCTGGTTC C ^ I 0 "° - 1^) 

FN3R GTATAGTOGACACCCCKJITTCAGGCCGCTC j 0 ^° : 1 

FN4F CGGGTGTCGACTATACCATCACTGTATACGCT ^^|0M>,J£|) 

FN4R CGGGATCCGAGCTCGCTGGGCTGTCAC^ ^ '9 ^ ^ 

15 FN5F CAGCGAGCTCCAAGCCAATCTCGATTAACTACCGT /0<^ > :' l )J 
FN5R C GGGATCCT CGAGTTACTAGGTACGGTAGTTAATCGA ^ $><3, |0 AX>..ZO^ 
FN5R' C GGG ATCC ACGCGTGCCACCGGTACGGTAGTTAATCG A ( ^ JJ) ^°^V . 
gene3F CG GG ATCC ACGCGTCC ATTCGTTTGTG AATATC AAGGC / ^ Q J 

gene3R CCGG AAGCTTT AAGACTCCTTATTACGCAGTATGTTAGC [ ^ I 0 ^° ' 

20 38TAABglII CTGTTACIX3GCCGTGAG ATCTAACCAGCG AGCTC C '° ^* J ^d .ZT?) 

BC3 GATCAGCTCGGATGCTCCTN^ ' ^J'JlS) 

FG2 TGTATACGCTGTTACTCX^^ * ' 

FG3 CTGTATACGCTCTIT^ ^ t^>\^J 

FG4 CATCACTGTATACGCTGTT^ 5<K ) ® ^° - 



Restriction enzyme sites are underlined. N and K denote an equimolar mixture of 
A, T. G and C and that of G and T, respectively. 

30 Structure and stability measurements 

ructures of Abs were analyzed using quantitative methods (e.g., DSSP 
(Kabsch A^ander, 1983) and PDBfit (D. McRee, The Scripps Research Institute)) 
^ W ell as compter graphics (e.g., Quanta (Molecular Simulations) and What if (G. 
Vriend, European Mbi^cular Biology Laboratory)) to superimpose the strand-loop- 
35 strand structures of Abs ahd Fn3. 
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^The stability ot FnAbs was determined by measuring temperature- and 
chemicalMenaturant-induced unfolding reactions (Pace et al., 1989). The 
temperature^induced unfolding reaction was measured using a circular dichroism 
(CD) polarimefcer. Ellipticity at 222 and 215 nm was recorded as the sample 
temperature was slowly raised. Sample concentrations between 10 and 50 were 
used. After the unfokiing baseline was established, the temperature was lowered to 
examine the reversibilfW of the unfolding reaction. Free energy of unfolding was 
determined by fitting datkxo the equation for the two-state transition (Becktel & 
Schellman, 1987; Pace et aK 1989). Nonlinear least-squares fitting was performed 
10 using the program Igor (WaveN^etrics) on a Macintosh computer. 

The structure and stability of two selected mutant Fn3s were studied; the first 
mutant was D 1 .3-4 (Table 2) and the second was a mutant called AS40 which 
2 contains four mutations in the BC loop (A 26 V 27 T 28 V 29 ) - TQRQ). AS40 was 

T s Is randomly chosen from the BC loop library described above. Both mutants were 

2f 15 expressed as soluble proteins in E. coli and were concentrated at least to 1 raM, 
N> permitting NMR studies. 

The mid-point of the thermal denaturation for both mutants was 
approximately 69°C, as compared to approximately 79°C for the wild-type protein. 
The results indicated that the extensive mutations at the two surface loops did not 
20 drastically decrease the stability of Fn3, and thus demonstrated the feasibility of 
^ introducing a large number of mutations in both loops. 

Stability was also determined by guanidinium chloride (GdnCl)- and urea- 
induced unfolding reactions. Preliminary unfolding curves were recorded using a 
fluorometer equipped with a motor-driven syringe; GdnCl or urea were added 
25 continuously to the protein solution in the cuvette. Based on the preliminary 

unfolding curves, separate samples containing varying concentration of a denaturant 
were prepared and fluorescence (excitation at 290 nm, emission at 300-400 nm) or 
CD (ellipticity at 222 and 215 nm) were measured after the samples were 
equilibrated at the measurement temperature for at least one hour. The curve was 
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fitted by the least-sqtu^ method to the equation for the two-state model (Santoro & 
Bolen, 1988; Koide et al., 1993). The change in protein concentration was 
compensated if required. 

\)nce the reversibility of the thermal unfolding reaction is established, the 
unfolding reaction is measured by a Microcal MC-2 differential scanning 
calorimeter (DSC). The cell (~ 1.3 ml) will be filled with FnAb solution (0.1 - 
1 mM) and ACp (= AH/AT) will be recorded as the temperature is slowly raised. T m 
(the midpointW unfolding), AH of unfolding and AG of unfolding is determined by 
fitting the transition curve (Privalov & Potekhin, 1986) with the Origin software 
provided by Micnpcal. 



Thermal unfolding 

A temperature-induced unfolding experiment on Fn3 was performed using 
circular dichroism (CD) spectroscopy to monitor changes in secondary structure. 
The CD spectrum of the native Fn3 shows a weak signal near 222 nm (Fig. 3A), 
consistent with the predominantly p-structure of Fn3 (Perczel et al., 1992). A 
cooperative unfolding transition is observed at 80-90°C, clearly indicating high 
stability of Fn3 (Fig. 3B). The free energy of unfolding could not be determined due 
to the lack of a post-transition baseline. The result is consistent with the high 
stability of the first Fn3 domain of human fibronectin (Litvinovich et ah, 1992), thus 
indicating that Fn3 domains are in general highly stable. 



Binding assays 

Binding reaction of FnAbs were characterized quantitatively using an 
isothermal titration calorimeter (ITC) and fluorescence spectroscopy. 

^Fhe enthalpy change (AH) of binding were measured using a Microcal 
Omega ITC<Wiseman et al., 1989). The sample cell (~ 1.3 ml) was filled with 
FnAbs solution^ 100 jiM, changed according to K,,), and the reference cell filled 
with distilled watehohe system was equilibrated at a given temperature until a stable 
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vaseline is obtained; 5-20 \x\ of Iigand solution 2 mM) was injected by a motor- 
dtWn syringe within a short duration (20 sec) followed by an equilibration delay 
(4 mitautes); the injection was repeated and heat generation/absorption for each 
injectiorWas measured. From the change in the observed heat change as a function 
5 of Iigand concentration, AH and K,, was determined (Wiseman et al., 1989). AG and 
AS of the binding reaction was deduced from the two directly measured parameters. 
Deviation from the theoretical curve was examined to assess nonspecific (multiple- 
site) binding. Experiments were also be performed by placing a Iigand in the cell 
and titrating with an Fn&b. It should be emphasized that only YTC gives direct 
10 measurement of AH, thereby making it possible to evaluate enthalpic and entropic 
contributions to the binding energy. FTC was successfully used to monitor the 
binding reaction of the D1.3 ABSfTello et al., 1993; Bhat et al., 1994). 

Intrinsic fluorescence is monitored to measure binding reactions with K,, in 
the sub-^iM range where the determination of K d by ITC is difficult. Trp 
15 fluorescence (excitation at - 290 nm, emission at 300-350 nm) and Tyr fluorescence 
(excitation at ~ 260 nm, emission at ~ 303 nm) is monitored as the Fn3-mutant 
solution (<, 10 \iM) is titrated with Iigand solution (^ 100 ^iM). K d of the reaction is 
determined by the nonlinear least-squares fitting of the bimolecular binding 
equation. Presence of secondary binding sites is examined using Scatchard analysis. 
20 In all binding assays, control experiments are performed busing wild-type Fn3 (or 
unrelated FnAbs) in place of FnAbs of interest. 

II. Production of Fn3 mutants with high affinity a nd specificity FnAbs 
Library screening was carried out in order to select FnAbs which bind to 
25 specific ligands. This is complementary to the modeling approach described above. 
The advantage of combinatorial screening is that one can easily produce and screen 
a large number of variants (;> 10 8 ), which is not feasible with specific mutagenesis 
("rational design") approaches. The phage display technique (Smith, 1985; O'Neil 
& Hoess, 1995) was used to effect the screening processes. Fn3 was fused to a 
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phage coat protein (puT) and displayed on the surface of filamentous phages. These 
phages harbor a single-stranded DNA genome that contains the gene coding the Fn3 
fusion protein. The amino acid sequence of defined regions of Fn3 were 
randomized using a degenerate nucleotide sequence, thereby constructing a library. 
Phages displaying Fn3 mutants with desired binding capabilities were selected in 
vitro, recovered and amplified. The amino acid sequence of a selected clone can be 
identified readily by sequencing the Fn3 gene of the selected phage. The protocols 
of Smith (Smith & Scott, 1993) were followed with minor modifications. 

The objective was to produce FnAbs which have high affinity to small 
protein ligands. HEL and the B 1 domain of staphylococcal protein G (hereafter 
referred to as protein G) were used as ligands. Protein G is small (56 amino acids) 
and highly stable (Minor & Kim, 1994; Smith et al„ 1994). Its structure was 
determined by NMR spectroscopy (Gronenborn et al. f 1991) to be a helix packed 
against a four-strand p-sheet. The resulting FnAb-protein G complexes (~ 150 
residues) is one of the smallest protein-protein complexes produced to date, well 
within the range of direct NMR methods. The small size, the high stability and 
solubility of both components and the ability to label each with stable isotopes ( l3 C 
and 15 N; see below for protein G) make the complexes an ideal model system for 
NMR studies on protein-protein interactions. 

The successful loop replacement of Fn3 (the mutant D 1.3-4) demonstrate 
that at least ten residues can be mutated without the loss of the global fold. Based 
on this, a library was first constructed in which only residues in the FG loop are 
randomized. After results of loop replacement experiments on the BC loop were 
obtained, mutation sites were extended that include the BC loop and other sites. 

Construction of Fn3 phage display system 

An M13 phage-based expression vector pASMl has been constructed as 
follows: an oligonucleotide coding the signal peptide of OmpT was cloned at the 5' 
end of the Fn3 gene; a gene fragment coding the C-terminal domain of Ml 3 pin was 
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prepared from the wild-type gene m gene of M13 mpl8 using PCR (Corey et al., 
1993) and the fragment was inserted at the 3' end of the OmpT-Fn3 gene; a spacer 
sequence has been inserted between Fn3 and pin. The resultant fragment (OmpT- 
Fn3-pIQ) was cloned in the multiple cloning site of M13 mpl8, where the fusion 
5 gene is under the control of the lac promoter. This system will produce the Fn3-pHI 
fusion protein as well as the wild-type pin protein. The co-expression of wild-type 
pill is expected to reduce the number of fusion pin protein, thereby increasing the 
phage infectivity (Corey et al., 1993) (five copies of pin are present on a phage 
particle). In addition, a smaller number of fusion pin protein may be advantageous 
10 in selecting tight binding proteins, because the chelating effect due to multiple 
binding sites should be smaller than that with all five copies of fusion pm (Bass et 
al., 1990). This system has successfully displayed the serine protease trypsin (Corey 
et al., 1993). Phages were produced and purified using E. coli K91kan (Smith & 
Scott, 1993) according to a standard method (Sambrook et al., 1989) except that 

H 15 phage particles were purified by a second polyethylene glycol precipitation and acid 

si 

y! precipitation. 

V Successful display of Fn3 on fusion phages has been confirmed by ELISA 

* using an Ab against fibronectin (Sigma), clearly indicating that it is feasible to 

|B construct libraries using this system. 

20 An alternative system using the fUSE5 (Parmley & Smith, 1988) may also be 

|5 used. The Fn3 gene is inserted to fUSES using the Sfil restriction sites introduced at 

the 5'- and 3'- ends of the Fn3 gene PCR. This system displays only the fusion pm 
protein (up to five copies) on the surface of a phage. Phages are produced and 
purified as described (Smith & Scott, 1993). This system has been used to display 
25 many proteins and is robust. The advantage of fUSE5 is its low toxicity. This is 
due to the low copy number of the replication form (RF) in the host, which in turn 
makes it difficult to prepare a sufficient amount of RF for library construction 
(Smith & Scott, 1993). 
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Construction of libraries 

The first library was constructed of the Fn3 domain displayed on the surface 
of MB phage in which seven residues (77-83) in the FG loop (Fig. 4D) were 
randomized. Randomization will be achieved by the use of an oligonucleotide 
containing degenerated nucleotide sequence. A double-stranded nucleotide was 
prepared by the same protocol as for gene synthesis (see above) except that one 
strand had an (NNK) 6 (NNG) sequence at the mutation sites, where N corresponds to 
an equimolar mixture of A, T, G and C and K corresponds to an equimolar mixture 
of G and T. The (NNG) codon at residue 83 was required to conserve the Sad 
restriction site (Fig. 2). The (NNK) codon codes all of the 20 amino acids, while the 
NNG codon codes 14. Therefore, this library contained - 10 9 independent 
sequences. The library was constructed by ligating the double-stranded nucleotide 
into the wild-type phage vector, pASMl, and the transfecting E. coli XL1 blue 
(Stratagene) using electroporation. XL1 blue has the lacl q phenotype and thus 
suppresses the expression of the Fn3-pin fusion protein in the absence of lac 
inducers. The initial library was propagated in this way, to avoid selection against 
toxic Fn3-pDI clones. Phages displaying the randomized Fn3-pm fusion protein 
were prepared by propagating phages with K91kan as the host. K91kan does not 
suppress the production of the fusion protein, because it does not have lacP. 
Another library was also generated in which the BC loop (residues 26-20) was 
randomized. 

Selection of displayed FnAbs 

Screening of Fn3 phage libraries was performed using the biopanning 
protocol (Smith & Scott, 1993); a ligand is biotinylated and the strong biotin- 
streptavidin interaction was used to immobilize the ligand on a streptavidin-coated 
dish. Experiments were performed at room temperature (~ 22°C). For the initial 
recovery of phages from a library, 10 fig of a biotinylated ligand were immobilized 
on a streptavidin-coated polystyrene dish (35 mm, Falcon 1008) and then a phage 
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solution (containing -^O 11 pfu (plaque-forming unit)) was added? After washing the 
dish with an appropriate buffer (typically TBST, Tris-HCl (50 mM, pH 7.5), NaCl 
(150 mM) and Tween 20 (0.5%)), bound phages were eluted by one or combinations 
of the following conditions: low pH, an addition of a free ligand, urea (up to 6 M) 

5 and, in the case of anti-protein G FnAbs, cleaving the protein G-biotin linker by 
thrombin. Recovered phages were amplified using the standard protocol using 
K91kan as the host (Sambrook et ah, 1989). The selection process were repeated 3- 
5 times to concentrate positive clones. From the second round on, the amount of the 
ligand were gradually decreased (to - 1 |ig) and the biotinylated ligand were mixed 

10 with a phage solution before transferring a dish (G. P. Smith, personal 

communication). After the final round, 10-20 clones were picked, and their DNA 
sequence will be determined. The ligand affinity of the clones were measured first 
by the phage-ELIS A method (see below). 

To suppress potential binding of the Fn3 framework (background binding) to 

15 a ligand, wild-type Fn3 may be added as a competitor in the buffers. In addition, 
unrelated proteins (e.g., bovine serum albumin, cytochrome c and RNase A) may be 
used as competitors to select highly specific FnAbs. " 



Binding assay 

20 TRhe binding affinity of FnAbs on phage surface is characterized semi- 

quantitative using the phage EUSA technique (Li et aL, 1995). Wells of 
microtiter plates (Nunc) are coated with a ligand protein (or with streptavidin 
followed by thetinding of a biotinylated ligand) and blocked with the Blotto 
solution (Pierce), purified phages (~ 10 10 pfu) originating from single plaques 

25 (M13)/colonies (fUSES) are added to each well and incubated overnight at 4°C. 
After washing wells wit\an appropriate buffer (see above), bound phages are 
detected by the standard EDK5A protocol using anti-M13 Ab (rabbit, Sigma) and 
anti-rabbit Ig-peroxidase conjugate (Pierce) or using anti-M13 Ab-peroxidase 
conjugate (Pharmacia). Colormefcric assays are performed using TMB (3,3 ',5,5'- 
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*tramethylbenzidine^^rce). The high affinity of protein G to immunoglobulins 
pres&nt a special problem; Abs cannot be used in detection. Therefore, to detect 
anti-protein G FnAbs, fusion phages are immobilized in wells and the binding is 
then measuted using biotinylated protein G followed by the detection using 
5 streptavidin-peKQxidase conjugate. 

Production of soluble FnAbs 

After preliminary characterization of mutant Fn3s using phage ELIS A, 
mutant genes are subcloned into the expression vector pEWl. Mutant proteins are 
10 produced as His*tag fusion proteins and purified, and their conformation, stability 
and ligand affinity are characterized. 

Thus, Fn3 is the fourth example of a monomeric immunoglobulin-like 

Cm scaffold that can be used for engineering binding proteins. Successful selection of 

W 

si 15 novel binding proteins have also been based on minibody, tendamistat and 

"camelized" immunoglobulin VH domain scaffolds (Martin et al., 1994; Davies & 
Riechmann, 1995; McConnell & Hoess, 1995). The Fn3 scaffold has advantages 
over these systems. Bianchi et al. reported that the stability of a minibody was 2.5 
kcal/mol, significantly lower than that of Ubi4-K. No detailed structural 
20 characterization of minibodies has been reported to date. Tendamistat and the VH 
domain contain disulfide bonds, and thus preparation of correctly folded proteins 
may be difficult. Davies and Riechmann reported that the yields of their camelized 
VH domains were less than 1 mg per liter culture (Davies & Riechmann, 1996). 
Thus, the Fn3 framework can be used as a scaffold for molecular 
25 recognition. Its small size, stability and well-characterized structure make Fn3 an 
attractive system. In light of the ubiquitous presence of Fn3 in a wide variety of 
natural proteins involved in ligand binding, one can engineer Fn3-based binding 
proteins to different classes of targets. 

The following examples are intended to illustrate but not limit the invention. 
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EXAMPLE I 



Construction of the Fn3 gene 
A synthetic gene for tenth Fn3 of fibronectin (Fig.l) was designed on the 
basis of amino acid residue 1416-1509 of human fibronectin (Kornblihtt, et a/., 

5 1985) and its three dimensional structure (Main, et al % 1992). The gene was 

engineered to include convenient restriction sites for mutagenesis and the so-called 
"preferred codons" for high level protein expression (Gribskov, et ai 9 1984) were 
used. In addition, a glutamine residue was inserted after the N-terminal methionine 
in order to avoid partial processing of the N-terminal methionine which often 

10 degrades NMR spectra (Smith, et aU 1994). Chemical reagents were of the 

analytical grade or better and purchased from Sigma Chemical Company and J.T. 
Baker, unless otherwise noted. Recombinant DNA procedures were performed as 
described in "Molecular Cloning" (Sambrook, et al, 1989), unless otherwise stated. 
Custom oligonucleotides were purchased from Operon Technologies. Restriction 

15 and modification enzymes were from New England Biolabs. 

^thc gene was assembled in the following manner. First, the gene sequence 
(Fig. 5) was divided into five parts with boundaries at designed restriction sites: 
fragment 1, Ndel-PstI (oligonucleotides FN IF and FN1R (Table 2); fragment 2, 
Pstl-EcoRI (fW and FN2R); fragment 3, EcoRI-Sall (FN3F and FN3R); fragment 

20 4, Sall-SacI (FN4F and FN4R); fragment 5, SacI-BamHI (FN5F and FN5R). 

Second, for each part, a pair of oligonucleotides which code opposite strands and 
have comp!ementaiy\yerlaps of approximately 15 bases was synthesized. These 
oligonucleotides were designated FN1F-FN5R and are shown in Table 2. Third, 
each pair (e.g., FN1F and F\N1R) was annealed and single-strand regions were filled 

25 in using the Klenow fragment of DNA polymerase. Fourth, the double stranded 
oligonucleotide was digested wkh the relevant restriction enzymes at the termini of 
the fragment and cloned into the pBlueScript SK plasmid (Stratagene) which had 
been digested with the same enzymes as those used for the fragments. The DNA 
sequence of the inserted fragment was\onfirmed by DNA sequencing using an 
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Appiifed Biosys terns DNA sequencer and the dideoxy termination protocol provided 
by the mzmhfocturer. Last, steps 2-4 were repeated to obtain the entire gene. 

The gene was also cloned into the pET3a and pET15b (Novagen) vectors 
(pAS45 and pAS25, respectively). The maps of the plasmids are shown in Figs. 6 
5 and 7. E. coli BL21 (DE3) (Novagen) containing these vectors expressed the Fn3 
gene under the control of bacteriophage T7 promotor (Studier, et al, 1990); pAS24 
expresses the 96-residue Fn3 protein only, while pAS45 expresses Fn3 as a fusion 
protein with poly-histidine peptide (Hisnag). High level expression of the Fn3 
protein and its derivatives in E. coli was detected as an intense band on SDS-PAGE 
10 stained with CBB. 

The binding reaction of the monobodies is characterized quantitatively by 
means of fluorescence spectroscopy using purified soluble monobodies. 

Intrinsic fluorescence is monitored to measure binding reactions. Trp 
fluorescence (excitation at -290 nm, emission at 300 350 nm) and Tyr fluorescence 
15 (excitation at -260 nm, emission at -303 nm) is monitored as the Fn3-mutant 

solution (< 100 ^iM) is titrated with a ligand solution. When a ligand is fluorescent 
(e.g. fluorescein), fluorescence from the ligand may be used. K d of the reaction will 
be determined by the nonlinear least-squares fitting of the bimolecular binding 
equation. 

20 If intrinsic fluorescence cannot be used to monitor the binding reaction, 

monobodies are labeled with fluorescein-NHS (Pierce) and fluorescence 
polarization is used to monitor the binding reaction (Burke et al y 1996). 

EXAMPLE II 

25 Modifications to include restriction sites in the Fn3 gene 

The restriction sites were incorporated in the synthetic Fn3 gene without 
changing the amino acid sequence Fn3. The positions of the restriction sites were 
chosen so that the gene construction could be completed without synthesizing long 
(>60 bases) oligonucleotides and so that two loop regions could be mutated 
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TLizationl bv the cassette mutagenesis methoa (ij 



(including by randomization) by the cassette mutagenesis methoa (i.e., swapping a 
fragment with another synthetic fragment containing mutations). In addition, the 
restriction sites were chosen so that most sites were unique in the vector for phage 
display. Unique restriction sites allow one to recombine monobody clones which 
5 have been already selected in order to supply a larger sequence space. 

EXAMPLE III 
Construction of M13 phage display libraries 

A vector for phage display, pAS38 (for its map, see Fig. 8) was constructed 
10 as follows. The Xbal-BamHI fragment of pET12a encoding the signal peptide of 
OmpT was cloned at the 5* end of the Fn3 gene. The C-terminal region (from the 
FN5F and FN5R oligonucleotides, see Table 2) of the Fn3 gene was replaced with a 
new fragment consisting of the FN5F and FN5R* oligonucleotides (Table 2) which 
Ci introduced a Mlul site and a linker sequence for making a fusion protein with the 

1 5 pIU protein of bacteriophage M 1 3. A gene fragment coding the C-terminal domain 
of M13 pDI was prepared from the wild-type gene EI of M13mpl8 using PCR 
(Corey, et ai t 1993) and the fragment was inserted at the 3' end of the OmpT-Fn3 
fusion gene using the Mlul and Hindlll sites. 

Phages were produced and purified using a helper phage, M13K07, 
20 according to a standard method (Sambrook, et a/., 1989) except that phage particles 
were purified by a second polyethylene glycol precipitation. Successful display of 
Fn3 on fusion phages was confirmed by EUSA (Harlow & Lane, 1988) using an 
antibody against fibronectin (Sigma) and a custom anti-FN3 antibody (Cocalico 
Biologicals, PA, USA). 

25 
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EXAMPLE IV 
Libraries containing loop variegations in the AB loop 

A nucleic acid phage display library having variegation in the AB loop is 
prepared by the following methods. Randomization is achieved by the use of 

5 oligonucleotides containing degenerated nucleotide sequence. Residues to be 
variegated are identified by examining the X-ray and NMR structures of Fn3 
(Protein Data Bank accession numbers, 1FNA and 1TTF, respectively). 
Oligonucleotides containing NNK (N and K here denote an equimolar mixture of A, 
T, G, and C and an equimolar mixture of G and T, respectively) for the variegated 

10 residues are synthesized (see oligonucleotides BC3, FG2 f FG3, and FG4 in Table 2 
for example). The NNK mixture codes for all twenty amino acids and one 
termination codon (TAG). TAG, however, is suppressed in the £. coli XL-1 blue. 
Single-stranded DNAs of pAS38 (and its derivatives) are prepared using a standard 
protocol (Sambrook, etai, 1989). 

15 ^te-directed mutagenesis is performed following published methods (see for 

example, \unkel, 1985) using a Muta-Gene kit (BioRad). The libraries are 
constructed bv electroporation of £. coli XL-1 Blue electroporation competent cells 
(200 pi; Stratagtoie) with lpg of the plasmid DNA using a BTX electrocell 
manipulator ECN^395 1mm gap cuvette. A portion of the transformed cells is 

20 plated on an LB-aga^pIate containing ampicillin (100 pg/ml) to determine the 

transformation efficiency. Typically, 3 X 10 8 transformants are obtained with 1 \ig 
of DNA, and thus a HbraX contains 10 8 to 10 9 independent clones. Phagemid 
particles were prepared as described above. 



25 EXAMPLE V 

Loop variegations in the BC, CD, DE, EF or FG loop 

A nucleic acid phage display library having five variegated residues (residues 
number 26-30) in the BC loop, and one having seven variegated residues (residue 
numbers 78-84) in the FG loop, was prepared using the methods described in 
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Example IV above. Orar nucleic acid phage display libraries raving variegation in 
the CD, DE or EF loop can be prepared by similar methods. 



EXAMPLE VI 
Loop variegations in the FG and BC loop 

A nucleic acid phage display library having seven variegated residues 
(residues number 78-84) in the FG loop and five variegated residues (residue 
number 26-30) in the BC loop was prepared. Variegations in the BC loop were 
prepared by site-directed mutagenesis (Kunkel, et al) using the BC3 oligonucleotide 
described in Table 1. Variegations in the FG loop were introduced using site- 
directed mutagenesis using the BC loop library as the starting material, thereby 
resulting in libraries containing variegations in both BC and FG loops. The 
oligonucleotide FG2 has variegating residues 78-84 and oligonucleotide FG4 has 
variegating residues 77-81 and a deletion of residues 82-84. 

A nucleic acid phage display library having five variegated residues (residues 
78-84) in the FG loop and a three residue deletion (residues 82-84) in the FG loop, 
and five variegated residues (residues 26-30) in the BC loop, was prepared. The 
shorter FG loop was made in an attempt to reduce the flexibility of the FG loop; the 
loop was shown to be highly flexible in Fn3 by the NMR studies of Main, et al 
(1992). A highly flexible loop may be disadvantageous to forming a binding site 
with a high affinity (a large entropy loss is expected upon the ligand binding, 
because the flexible loop should become more rigid). In addition, other Fn3 
domains (besides human) have shorter FG loops (for sequence alignment, see Figure 
12 in Dickinson, etal (1994)). 

Randomization was achieved by the use of oligonucleotides containing 
degenerate nucleotide sequence (oligonucleotide BC3 for variegating the BC loop 
and oligonucleotides FG2 and FG4 for variegating the FG loops). 

Site-directed mutagenesis was performed following published methods (see 
for example, Kunkel, 1985). The libraries were constructed by electrotransforming 
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It^^ene). Typically a library contains 10 
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E. co/* XL-1 Blue (StrSSgene). Typically a library contains 10 8 to 10 9 independent 
clones. Library 2 contains five variegated residues in the BC loop and seven 
variegated residues in the FG loop. Library 4 contains five variegated residues in 
each of the BC and FG loops, and the length of the FG loop was shortened by three 
5 residues. 

EXAMPLE VII 
fd phage display libraries constructed with loop variegations 
Phage display libraries are constructed using the fd phage as the genetic 
10 vector. The Fn3 gene is inserted in fUSES (Parmley & Smith, 1988) using Sfil 
restriction sites which are introduced at the 5' and 3' ends of the Fn3 gene using 
PCR. The expression of this phage results in the display of the fusion pm protein 
on the surface of the fd phage. Variegations in the Fn3 loops are introduced using 
site-directed mutagenesis as described hereinabove, or by subcloning the Fn3 
1 5 libraries constructed in Ml 3 phage into the fUSES vector. 

EXAMPLE VIII 
Other phage display libraries 

T7 phage libraries (Novagen, Madison, WI) and bacterial pili expression 
20 systems (Invitrogen) are also useful to express the Fn3 gene. 

EXAMPLE IX 

Isolation of polypeptides which bind to macromolecular structures 

The selection of phage-displayed monobodies was performed following the 
25 protocols of Barbas and coworkers (Rosenblum & Barbas, 1995). Briefly, 

approximately 1 |ig of a target molecule ("antigen") in sodium carbonate buffer (100 
mM, pH 8.5) was immobilized in the wells of a microtiter plate (Maxisorp, Nunc) 
by incubating overnight at 4°C in an air tight container. After the removal of this 
solution, the wells were then blocked with a 3% solution of BS A (Sigma, Fraction 
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V) in TBS by incubating the plate at 37 °C for 1 hour. A phagemid library solution 
(50 fil) containing approximately 10 12 colony forming units (cfu) of phagemid was 
absorbed in each well at 37 °C for 1 hour. The wells were then washed with an 
appropriate buffer (typically TBST, 50 mM Tris-HCl (pH 7.5), 150 mM NaCl, and 
0.5% Tween20) three times (once for the first round). Bound phage were eluted by 
an acidic solution (typically, 0.1 M glycine-HCl, pH 2.2; 50 ^1) and recovered phage 
were immediately neutralized with 3 fil of Tris solution. Alternatively, bound phage 
were eluted by incubating the wells with 50 \xl of TBS containing the antigen (1 - 
10 jiM). Recovered phage were amplified using the standard protocol employing 
the XLlBlue cells as the host (Sambrook, et ai). The selection process was 
repeated 5-6 times to concentrate positive clones. After the final round, individual 
clones were picked and their binding affinities and DNA sequences were 
determined. 

The binding affinities of monobodies on the phage surface were 
characterized using the phage ELISA technique (Li, et al % 1995). Wells of 
microtiter plates (Nunc) were coated with an antigen and blocked with BSA. 
Purified phages (10 8 - 10 u cfu) originating from a single colony were added to each 
well and incubated 2 hours at 37 °C. After washing wells with an appropriate buffer 
(see above), bound phage were detected by the standard ELISA protocol using anti- 
M13 antibody (rabbit, Sigma) and anti-rabbit Ig-peroxidase conjugate (Pierce). 
Colorimetric assays were performed using Turbo-TMB (3^,5^ 
tetramethylbenzidine, Pierce) as a substrate. 

The binding affinities of monobodies on the phage surface were further 
characterized using the competition ELISA method (Djavadi-Ohaniance, et al., 
1996). In this experiment, phage ELISA is performed in the same manner as 
described above, except that the phage solution contains a ligand at varied 
concentrations. The phage solution was incubated a 4°C for one hour prior to the 
binding of an immobilized ligand in a microtiter plate well. The affinities of phage 
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displayed monobodies are estimated by the decrease in ELIS A signal as the free 
ligand concentration is increased. 

After preliminary characterization of monobodies displayed on the surface of 
phage using phage ELISA, genes for positive clones were subcloned into the 
5 expression vector pAS45. E. coli BL21(DE3) (Novagen) was transformed with an 
expression vector (pAS45 and its derivatives). Cells were grown in M9 minimal 
medium and M9 medium supplemented with Bactotryptone (Difco) containing 
ampicillin (200 ^g/ml). For isotopic labeling, l5 N NH 4 C1 and/or l3 C glucose 
replaced unlabeled components. Stable isotopes were purchased from Isotec and 
10 Cambridge Isotope Labs. 500 ml medium in a 2 1 baffle flask was inoculated with 
10 ml of overnight culture and agitated at approximately 140 rpm at 37 °C. IPTG 
was added at a final concentration of 1 mM to induce protein expression when 
OD(600 nm) reached approximately 1.0. The cells were harvested by centrifugation 
3 hours after the addition of IPTG and kept frozen at -70°C until used. 
15 \n3 and monobodies with His^tag were purified as follows. Cells were 

suspendeXin 5 ml/(g cell) of 50 mM Tris (pH 7.6) containing 1 mM 
phenylmethWulfonyl fluoride. HEL (Sigma, 3X crystallized) was added to a final 
p concentration Of 0.5 mg/ml. After incubating the solution for 30 min at 37 °C, it was 

sonicated so as tos^ause cell breakage three times for 30 seconds on ice. Cell debris 
\ 20 was removed by cehtrifugation at 15,000 rpm in an Sorval RC-2B centrifuge using 

an SS-34 rotor. Concentrated sodium chloride is added to the solution to a final 
concentration of 0.5 M.Vhe solution was then applied to a 1 ml HisTrap™ 
chelating column (Pharmafcia) preloaded with nickel chloride (0.1 M, 1 ml) and 
equilibrated in the Tris buffer (50 mM, pH 8.0) containing 0.5 M sodium chloride. 
25 After washing the column witttUhe buffer, the bound protein was eluted with a Tris 
buffer (50 mM, pH 8.0) containing 0.5 M imidazole. The HisHag portion was 
cleaved off, when required, by treating the fusion protein with thrombin using the 
protocol supplied by Novagen (Madison, WI). Fn3 was separated from the His^tag 
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n^^i Resources®column (Pharmacia) usi^^li 



peptide and^hrombin^pi Resources®column (Pharmacia) usin^Tlinear gradient of 
sodium chloridb^O - 0.5 M) in sodium acetate buffer (20 mM, pH 5.0). 

Small amounts of soluble monobodies were prepared as follows. XL-1 Blue 
cells containing pAS38 derivatives (plasmids coding Fn3-pm fusion proteins) were 
5 grown in LB media at 37 °C with vigorous shaking until OD(600 nm) reached 
approximately 1.0; IPTG was added to the culture to a final concentration of 1 mM, 
and the cells were further grown overnight at 37°C. Cells were removed from the 
medium by centrifugation, and the supernatant was applied to a microtiter well 
coated with a ligand. Although XL-1 Blue cells containing pAS38 and its 
10 derivatives express FN3-pIII fusion proteins, soluble proteins are also produced due 
to the cleavage of the linker between the Fn3 and pin regions by proteolytic 
activities of E. coli (Rosenblum & Barbas, 1995). Binding of a monobody to the 
ligand was examined by the standard ELISA protocol using a custom antibody 
against Fn3 (purchased from Cocalico Biologicals, Reamstown, PA). Soluble 
^ 15 monobodies obtained from the periplasmic fraction of E. coli cells using a standard 
fit osmotic shock method were also used. 



EXAMPLE X 
Ubiquitin binding monobody 

20 Ubiquitin is a small (76 residue) protein involved in the degradation pathway 

in eurkaryotes. It is a single domain globular protein. Yeast ubiquitin was 
purchased from Sigma Chemical Company and was used without further 
purification. 

Libraries 2 and 4, described in Example VI above, were used to select 
25 ubiquitin-binding monobodies. Ubiquitin (1 |ig in 50 fil sodium bicarbonate buffer 
(100 mM, pH 8.5)) was immobilized in the wells of a microtiter plate, followed by 
blocking with BS A (3% in TBS). Panning was performed as described above. In 
the first two rounds, 1 \xg of ubiquitin was immobilized per well, and bound phage 
were elute with an acidic solution. From the third to the sixth rounds, 0.1 \xg of 
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ubiquitin was immobilized per well and the phage were eluted either with an acidic 
solution or with TBS containing 10 uM ubiquitin. 

Binding of selected clones was tested first in the polyclonal mode, i.e., 
before isolating individual clones. Selected clones from all libraries showed 
5 significant binding to ubiquitin. These results are shown in Figure 9. The binding 
to the immobilized ubiquitin of the clones was inhibited almost completely by less 
than 30 uM soluble ubiquitin in the competition ELISA experiments (see Fig. 10). 
The sequences of the BC and FG loops of ubiquitin-binding monobodies is shown in 
Table 3. 

10 

Table 3. Sequences of ubiquitin-binding monobodies 



Occurrence (if 

Name BC loop FG loop more than one) 

2 1 1 CARRA w wo '-^RWIPLAK & 2 

15 212 CWRRA RWVGLAW 

213 CKHRR FADLWWR 

214 CRRGR RGFMWLS 

215 CNWRR RAYRYRW 

411 SRLRR PPWRV 9 

20 422 ARWTL RRWWW 

424 GQRTF RRWWA 



The 41 1 clone, which was the most enriched clone, was characterized using 
phage ELISA. The 411 clone showed selective binding and inhibition of binding in 
25 the presence of about 10 uM ubiquitin in solution (Fig. 1 1 ). 



EXAMPLE XI 
Methods for the immobilization of small molecules 

TargeNnolecules were immobilized in wells of a microtiter plate (Maxisorp, 
30 Nunc) as describedhereinbelow, and the wells were blocked with BSA. In addition 
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to the usedi.carrier proem as described below, a conjugate of target molecule in 
biotin can be mide. The biotinylated ligand can then be immobilized to a microtiter 
plate well which ha^heen coated with streptavidin. 

In addition to the use of a carrier protein as described below, one could make 
a conjugate of a target molecule and biotin (Pierce) and immobilize a biotinylated 
ligand to a microtiter plate well which has been coated with streptavidin (Smith and 
Scott, 1993). 

Small molecules may be conjugated with a carrier protein such as bovine 
serum albumin (BSA, Sigma), and passively adsorbed to the microtiter plate well. 
Alternatively, methods of chemical conjugation can also be used. In addition, solid 
supports other than microtiter plates can readily be employed. 

EXAMPLE XII 
Fluorescein binding monobody 

Fluorescein has been used as a target for the selection of antibodies from 
combinatorial libraries (Barbas, et al 1992). NHS-fluorescein was obtained from 
Pierce and used according to the manufacturer's instructions in preparing conjugates 
with BSA (Sigma). Two types of fluorescein-BS A conjugates were prepared with 
approximate molar ratios of 17 (fluorescein) to one (BSA). 

The selection process was repeated 5-6 times to concentrate positive clones. 
In this experiment, the phage library was incubated with a protein mixture (BSA, 
cytochrome C (Sigma, Horse) and RNaseA (Sigma, Bovine), 1 mg/ml each) at room 
temperature for 30 minutes, prior to the addition to ligand coated wells. Bound 
phage were eluted in TBS containing 10 \xM soluble fluorescein, instead of acid 
elution. After the final round, individual clones were picked and their binding 
affinities (see below) and DNA sequences were determined. 
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able 4. Clones from Library #2 

BC FG 
WT AVTVR RGDSPAS 



pLB24.1 


CNWRR 


RAYRYRW 


pLB24.2 


CMWRA 


RWGMLRR 


pLB24.3 


ARMRE 


RWLRGRY 


pLB24.4 


CARRR 


RRAGWGW 


_T TJO/l C 

pLt>Z4.D 




RAYRYRW 


pLB24.6 


RWRER 


RHPWTER 


pLB24.7 


CNWRR 


RAYRYRW 


pLB24.8 


ERR VP 


RLLLWQR 


pLB24.9 


GRGAG 


FGSFERR 


pLB24.11 


CRWTR 


RRWFDGA 


pLB24.12 


CNWRR 


RAYRYRW 




Clones from Library #4 


WT 


AVTVR 


GRGDS 



pLB25.1 

pLB25.2 

pLB25.3 

pLB25.4 

pLB25.5 

pLB25.6 

pLB25.7 

pLB25.9 

pLB25.11 

pLB25.12 



GQRTF 
GQRTF 
GQRTF 
LRYRS 
GQRTF 
GQRTF 
LRYRS 
LRYRS 
GQRTF 
LRYRS 



RRWWA 
RRWWA 
RRWWA 
GWRWR 
RRWWA 
RRWWA 
GWRWR 
GWRWR 
RRWWA 
GWRWR 



45 



:flfcterization of the binding affinities of^fcct 



Preliminary cnSlcterization of the binding affinities of^ected clones were 
performed using phage ELISA and competition phage ELISA (see Fig. 12 
(Fluorescein- 1) and Fig. 13 (Fluorescein-2)). The four clones tested showed specific 
binding to the ligand-coated wells, and the binding reactions are inhibited by soluble 
5 fluorescein (see Fig. 13). 



EXAMPLE XIII 
Digoxigenin binding monobody 

Digoxigenin-3-O-methyl-carbonyl-e-aminocapronicacid-NHS (Boehringer 
10 Mannheim) is used to prepare a digoxigenin-BSA conjugate. The coupling reaction 
is performed following the manufacturers 1 instructions. The digoxigenin-BSA 
conjugate is immobilized in the wells of a microtiter plate and used for panning. 
Panning is repeated 5 to 6 times to enrich binding clones. Because digoxigenin is 
sparingly soluble in aqueous solution, bound phages are eluted from the well using 
15 acidic solution. See Example XIV. 



EXAMPLE XIV 
TSAC (transition state analog compound) binding monobodies 
Carbonate hydrolyzing monobodies are selected as follows. A transition 
20 state analog for carbonate hydrolysis, 4-nitrophenyl phosphonate is synthesized by 
an Arbuzov reaction as described previously (Jacobs and Schultz, 1987). The 
phosphonate is then coupled to the carrier protein, BS A, using carbodiimide, 
followed by exhaustive dialysis (Jacobs and Schultz, 1987). The hapten-BSA 
conjugate is immobilized in the wells of a microtiter plate and monobody selection 
25 is performed as described above. Catalytic activities of selected monobodies are 
tested using 4-nitrophenyl carbonate as the substrate. 

Other haptens useful to produce catalytic monobodies are summarized in H. 
Suzuki (1994) and in N. R. Thomas (1994). 
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EXAMPLE XV 
NMR characterization of Fn3 and comparison of the Fn3 
secreted by yeast with that secreted by E. coli 
Nuclear magnetic resonance (NMR) experiments are performed to identify 
5 the contact surface between FnAb and a target molecule, e.g., monobodies to 
fluorescein, ubiquitin, RNaseA and soluble derivatives of digoxigenin. The 
information is then be used to improve the affinity and specificity of the monobody. 
Purified monobody samples are dissolved in an appropriate buffer for NMR 
spectroscopy using Amicon ultrafiltration cell with a YM-3 membrane. Buffers are 
10 made with 90 % H 2 O/10 % D 2 0 (distilled grade, Isotec) or with 100 % D 2 0. 

Deuterated compounds (e.g. acetate) are used to eliminate strong signals from them. 

R experiments are performed on a Varian Unity INOVA 600 
spectrometer equipped with four RF channels and a triple resonance probe with 
pulsed field gWient capability. NMR spectra are analyzed using processing 
15 programs such akFelix (Molecular Simulations), nmrPipe, PIPP, and CAPP 
(Garrett, et a/., 1994 ; Delaglio, et aL, 1995) on UNIX workstations. Sequence 
specific resonance alignments are made using well-established strategy using a set 
of triple resonance experiments (CBCA(CO)NH and HNCACB) (Grzesiek & Bax, 
1992; Wittenkind & Mueljer, 1993). 

Nuclear Overhauser effect (NOE) is observed between 'H nuclei closer than 
approximately 5 A, which allows one to obtain information on interproton distances. 
A series of double- and triple-resonance experiments (Table 5; for recent reviews on 
these techniques, see Bax & Grzesiek, 1993 and Kay, 1995) are performed to collect 
distance (i.e. NOE) and dihedral angle (J-coupling) constraints. Isotope-filtered 
25 experiments are performed to determine resonance assignments of the bound ligand 
and to obtain distance constraints within the ligand and those between FnAb and the 
ligand. Details of sequence specific resonance assignments and NOE peak 
assignments have been described in detail elsewhere (Clore & Gronenborn, 1991; 
Pascal, etai, 1994b; Metzler, etal, 1996). 



20 
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Table 5. NMR experi 



for structure characterization 



Experiment Name 
1. reference spectra 



Reference 



2D-'H, l5 N-HSQC 
2D-'H, l3 C-HSQC 



(Bodenhausen & Ruben, 1980; Kay, et al, 1992) 
(Bodenhausen & Ruben, 1980; Vuister & Bax, 1992) 



2. backbone and side chain resonance assignments of l3 C/ l5 N-labeled protein 



m 

Id 



CO 



5=1 



10 



3D-CBCA(CO)NH 

3D-HNCACB 

3D-C(CO)NH 

3D-H(CCO)NH 
15 3D-HBHA(CBCACO)NH 

3D-HCCH-TOCSY 

3D-HCCH-COSY 

3D-'H, ,5 N-TOCSY-HSQC 

2D-HB (CBCDCE)HE 
20 

3. resonance assignments of unlabeled ligand 



2D-isotope-filtered 'H-TOCSY 
2D-isotope-filtered 'H-COSY 
25 2D-isotope-fiItered 'H-NOESY 



(Grzesiek & Bax, 1992) 
(Wittenkind & Mueller, 1993) 
(Logan et al, 1992; Grzesiek et ai, 1993) 



(Grzesiek & Bax, 1993) 
(Kay etal., 1993) 
(Bcuraefa/., 1991) 
(Zhang etal, 1994) 
(Yamazaki etal., 1993) 



(Ikura & Bax, 1992) 



4. structural constraints 
within labeled protein 

3D-'H, 15 N-NOESY-HSQC (Zhang et al., 1994) 

30 4D-'H, 1J C-HMQC-NOESY-HMQC (Vuister et al, 1993) 

4D-'H, ,3 C, ,5 N-HSQC-NOESY-HSQC (Muhandiram et al, 1993; Pascal etal, 1994a) 
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within unlabeled liganc 
2D-isotope-filtered 'H-NOESY (Dcura&Bax, 1992) 

interactions between protein and ligand 
3D-isotope-filtered >H, ,5 N-NOESY-HSQC 
5 3D-isotope-filtered l H, ,3 C-NOESY-HSQC (Lee et al. 9 1994) 

5. dihedral angle constraints 

J-molulated *H, ,5 N-HSQC (Billeter etai, 1992) 

10 3D-HNHB (Archer et ai, 1991) 



D 



y 



ackbone *H, I5 N and I3 C resonance assignments for a monobody are 
comparekto those for wild-type Fn3 to assess structural changes in the mutant. 
15 Once these (lata establish that the mutant retains the global structure, structural 
refinement is performed using experimental NOE data. Because the structural 
difference of a monobody is expected to be minor, the wild-type structure can be 
used as the initial model after modifying the amino acid sequence. The mutations 
are introduced to the\vild-type structure by interactive molecular modeling, and then 
20 the structure is energy-ftiinimized using a molecular modeling program such as 
Quanta (Molecular Simulations). Solution structure is refined using cycles of 
dynamical simulated annealing (Nilges et ai 9 1988) in the program X-PLOR 
(Brunger, 1992). Typically, ^ensemble of fifty structures is calculated. The 
validity of the refined structuresis confirmed by calculating a fewer number of 
25 structures from randomly generated initial structures in X-PLOR using the YASAP 
protocol (Nilges, et ai 9 1991). StrucVe of a monobody-ligand complex is 
calculated by first refining both components individually using intramolecular 
NOEs, and then docking the two using intefcpiolecular NOEs. 

For example, the ! H, l5 N-HSQC spectrum for the fluorescein-binding 
30 monobody LB25.5 is shown in Figure 14. The spectrum shows a good dispersion 
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(peaks are spread out^aicating that LB25.5 is folded into a glWular conformation. 
Further, the spectrum resembles that for the wild-type Fn3, showing that the overall 
structure of LB25.5 is similar to that of Fn3. These results demonstrate that ligand- 
binding monobodies can be obtained without changing the global fold of the Fn3 
scaffold. 

Chemical shift perturbation experiments are performed by forming the 
complex between an isotope-labeled FnAb and an unlabeled ligand. The formation 
of a stoichiometric complex is followed by recording the HSQC spectrum. Because 
chemical shift is extremely sensitive to nuclear environment, formation of a 
complex usually results in substantial chemical shift changes for resonances of 
amino acid residues in the interface. Isotope-edited NMR experiments (2D HSQC 
and 3D CBCA(CO)NH) are used to identify the resonances that are perturbed in the 
labeled component of the complex; i.e. the monobody. Although the possibility of 
artifacts due to long-range conformational changes must always be considered, 
substantial differences for residues clustered on continuous surfaces are most likely 
to arise from direct contacts (Chen et ai, 1993; Gronenborn & Clore, 1993). 

An alternative method for mapping the interaction surface utilizes amide 
hydrogen exchange (HX) measurements. HX rates for each amide proton are 
measured for l5 N labeled monobody both free and complexed with a ligand. Ligand 
binding is expected to result in decreased amide HX rates for monobody residues in 
the interface between the two proteins, thus identifying the binding surface. HX 
rates for monobodies in the complex are measured by allowing HX to occur for a 
variable time following transfer of the complex to D 2 0; the complex is dissociated 
by lowering pH and the HSQC spectrum is recorded at low pH where amide HX is 
slow. Fn3 is stable and soluble at low pH, satisfying the prerequisite for the 
experiments. 

EXAMPLE XVI 

Construction and Analysis of Fn3-Display System Specific for Ubiquitin 
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p^^ystem was designed and synthesized,^^i 



An Fn3-dispH^ystem was designed and synthesized, uTJIquitin-binding 
clones were isolated and a major Fn3 mutant in these clones was biophysically 
characterized. 

Gene construction and phage display of Fn3 was performed as in Examples I 
5 and II above. The Fn3-phage plU fusion protein was expressed from a phagemid- 
display vector, while the other components of the M13 phage, including the wild- 
type pm, were produced using a helper phage (Bass et al., 1990). Thus, a phage 
produced by this system should contain less than one copy of Fn3 displayed on the 
surface. The surface display of Fn3 on the phage was detected by ELISA using an 
10 anti-Fn3 antibody. Only phages containing the Fn3-pm fusion vector reacted with 
the antibody. 

After confirming the phage surface to display Fn3, a phage display library of 
Fn3 was constructed as in Example HI. Random sequences were introduced in the 
BC and FG loops. In the first library, five residues (77-81) were randomized and 
15 three residues (82-84) were deleted from the FG loop. The deletion was intended to 
reduce the flexibility and improve the binding affinity of the FG loop. Five residues 
y> (26-30) were also randomized in the BC loop in order to provide a larger contact 

surface with the target molecule. Thus, the resulting library contains five 
S randomized residues in each of the BC and FG loops (Table 6). This library 

20 contained approximately 10 8 independent clones. 

fjhrarv Screening 

Library screening was performed using ubiquitin as the target molecule. In 
each round of panning, Fn3-phages were absorbed to a ubiquitin-coated surface, and 
25 bound phages were eluted competitively with soluble ubiquitin. The recovery ratio 
improved from 4.3 x 10' 7 in the second round to 4.5 x 10" 6 in the fifth round, 
suggesting an enrichment of binding clones. After five founds of panning, the 
amino acid sequences of individual clones were determined (Table 6). 



SJ 
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Table 6. 




[uences in the variegated loops of enricned clones 




Name BC loop FG loop 



Frequency 



Wild Type GCAGTTACCGTGCGT GGCCGTGGTGACAGCCCAGCGAGC 
AlaValThrValArg GlyArgGIyAspSerProAlaSer 

Library" NNKNNKNNKNNKNNK NNKNNKNNKNNKNNK 

XXXXX XXXX X (deletion) 



clonel TCGAGGTTGCGGCGG CCGCCGTGGAGGGTG 
(Ubi4) SerArgLeuArgArg ProProTrpArgVal 
clone2 GGTCAGCGAACTTTT AGGCGGTGGTGGGCT 

GlyGlnArgThrPhe ArgArgTrpTrpAla 
clone3 GCGAGGTGGACGCTT AGGCGGTGGTGGTGG 

AlaArgTrpThrLeu ArgArgTrpTrpTrp 



9 



1 N denotes an equimolar mixture of A, T, G and C; K denotes an equimolar mixture of G and T. 

A clone, dubbed Ubi4, dominated the enriched pool of Fn3 variants. Therefore, 
further investigation was focused on this Ubi4 clone. Ubi4 contains four mutations 
in the BC loop (Arg 30 in the BC loop was conserved) and five mutations and three 
deletions in the FG loop. Thus 13% (12 out of 94) of the residues were altered in 
Ubi4 from the wild-type sequence. 

Figure 15 shows a phage ELISA analysis of Ubi4. The Ubi4 phage binds to 
the target molecule, ubiquitin, with a significant affinity, while a phage displaying 
the wild-type Fn3 domain or a phase with no displayed molecules show little 
detectable binding to ubiquitin (Figure 15a). In addition, the Ubi4 phage showed a 
somewhat elevated level of background binding to the control surface lacking the 
ubiquitin coating. A competition ELISA experiments shows the IC^ (concentration 
of the free ligand which causes 50% inhibition of binding) of the binding reaction is 
approximately 5 |iM (Fig. 15b). BSA, bovine ribonuclease A and cytochrome C 
show little inhibition of the Ubi4-ubiquitin binding reaction (Figure 15c), indicating 
that the binding reaction of Ubi4 to ubiquitin does result from specific binding. 



52 



'iftitant Fn3 Protein 
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Characterization of snETutant Fn3 Protein 

The expression system yielded 50-100 mg Fn3 protein per liter culture. A 
similar level of protein expression was observed for the Ubi4 clone and other mutant 
Fn3 proteins. 

5 Ubi4-Fn3 was expressed as an independent protein. Though a majority of 

Ubi4 was expressed in E. coli as a soluble protein, its solubility was found to be 
significantly reduced as compared to that of wild-type Fn3. Ubi4 was soluble up to 
-20 \iM at low pH, with much lower solubility at neutral pH. This solubility was 
not high enough for detailed structural characterization using NMR spectroscopy or 

10 X-ray crystallography. 

The solubility of the Ubi4 protein was improved by adding a solubility tail, 
GKKGK, as a C-terminal extension. The gene for Ubi4-Fn3 was subcloned into the 
expression vector pAS45 using PCR. The C-terminal solubilization tag, GKKGK, 
was incorporated in this step. E. coli BL21 (DE3) (Novagen) was transformed with 

15 the expression vector (pAS45 and its derivatives). Cells were grown in M9 minimal 
media and M9 media supplemented with Bactotryptone (Difco) containing 
ampicillin (200 ng/ml). For isotopic labeling, l5 N NH 4 C1 replaced unlabeled NH 4 C1 
in the media. 500 ml medium in a 2 liter baffle flask was inoculated with 10 ml of 
overnight culture and agitated at 37°C IPTG was added at a final concentration of 1 

20 mM to initiate protein expression when OD (600 nm) reaches one. The cells were 
harvested by centrifugation 3 hours after the addition of IPTG and kept frozen at - 
70°C until used. 

roteins were purified as follows. Cells were suspended in 5 ml/(g cell) of 
Tris (50 mh^, pH 7.6) containing phenylmethylsulfonyl fluoride (1 mM). Hen egg 
25 lysozyme (Sigtoa) was added to a final concentration of 0.5 mg/ml. After incubating 
the solution for 3^i minutes at 37°C, it was sonicated three times for 30 seconds on 
ice. Cell debris was\emoved by centrifugation. Concentrated sodium chloride was 
added to the solution t^a final concentration of 0.5 M. The solution was applied to 
a Hi-Trap chelating columh^Pharmacia) preloaded with nickel and equilibrated in 
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i^l^ sodium chloride (0.5 M). After was^^t 



the f*is buffer contaiMg sodium chloride (0.5 M). After wasmBg the column with 
the buffehshistag-Fn3 was eluted with the buffer containing 500 mM imidazole. 
The protein w^s further purified using a Resources column (Pharmacia) with a NaCl 
gradient in a sodi^n acetate buffer (20 mM, pH 4.6). 
5 With the GKKGK tail, the solubility of the Ubi4 protein was increased to 

over 1 mM at low pH and up to -50 \xM at neutral pH. Therefore, further analyses 
were performed on Ubi4 with this C-terminal extension (hereafter referred to as 
Ubi4-K). It has been reported that the solubility of a minibody could be 
significantly improved by addition of three Lys residues at the N- or C-termini 
10 (Bianchi et al M 1994). In the case of protein Rop, a non-structured C-terminal tail is 
critical in maintaining its solubility (Smith et al., 1995). 

Oligomerization states of the Ubi4 protein were determined using a size 
exclusion column. The wild-type Fn3 protein was monomeric at low and neutral 
pH's. However, the peak of the Ubi4-K protein was significantly broader than that 
y 1 5 of wild-type Fn3, and eluted after the wild-type protein. This suggests interactions 
between Ubi4-K and the column material, precluding the use of size exclusion 
chromatography to determine the oligomerization state of Ubi4. NMR studies 
suggest that the protein is monomeric at low pH. 

The Ubi4-K protein retained a binding affinity to ubiquitin as judged by 
M= 20 ELISA (Figure 15d). However, an attempt to determine the dissociation constant 
using a biosensor (Affinity Sensors, Cambridge, U.K.) failed because of high 
background binding of Ubi4-K-Fn3 to the sensor matrix. This matrix mainly 
consists of dextran, consistent with our observation that interactions between Ubi4- 
K interacts with the cross-linked dextran of the size exclusion column. 
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Example XVII 
Stability Measurements of Monobodies 

Guanidine hydrochloride (GuHCl)-induced unfolding and refolding reactions 
were followed by measuring tryptophan fluorescence. Experiments were performed 
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on a Spectronic AB-2^ectrofluorometer equipped with a motoWfiven syringe 
(Hamilton Co.). The cuvette temperature was kept at 30°C. The spectrofluorometer 
and the syringe were controlled by a single computer using a home-built interface. 
This system automatically records a series of spectra following GuHCI titration. An 
experiment started with a 1.5 ml buffer solution containing 5 jiM protein. An 
emission spectrum (300-400 nm; excitation at 290 nm) was recorded following a 
delay (3-5 minutes) after each injection (50 or 100 pi) of a buffer solution 
containing GuHCI. These steps were repeated until the solution volume reached the 
full capacity of a cuvette (3.0 ml). Fluorescence intensities were normalized as 
ratios to the intensity at an isofluorescent point which was determined in separate 
experiments. Unfolding curves were fitted with a two-state model using a nonlinear 
least-squares routine (Santoro & Bolen, 1988). No significant differences were 
observed between experiments with delay times (between an injection and the start 
of spectrum acquisition) of 2 minutes and 10 minutes, indicating that the 
unfolding/refolding reactions reached close to an equilibrium at each concentration 
point within the delay times used. 

Conformational stability of Ubi4-K was measured using above-described 
GuHCl-induced unfolding method. The measurements were performed under two 
sets of conditions; first at pH 3.3 in the presence of 300 mM sodium chloride, where 
Ubi4-K is highly soluble, and second in TBS, which was used for library screening. 
Under both conditions, the unfolding reaction was reversible, and we detected no 
signs of aggregation or irreversible unfolding. Figure 16 shows unfolding 
transitions of Ubi4-K and wild-type Fn3 with the N-terminal (his) 6 tag and the C- 
terminal solubility tag. The stability of wild-type Fn3 was not significantly affected 
by the addition of these tags. Parameters characterizing the unfolding transitions are 
listed in Table 7. 
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Table 7. Stability paSmeters for Ubi4 and wild-type Fn3 as deflrniined by GuHCl- 
induced unfolding 



Protein 



AG 0 (kcal mol' 1 



) 



m Ci (kcal mol"' M ') 



5 



Ubi4 (pH 7.5) 
Ubi4 (pH 3.3) 
Wild-type (pH 7.5) 
Wild-type (pH 3.3) 



4.8 ±0.1 



2. 12 ±0.04 



6.5 ±0.1 



2.07 ±0.02 



7.2 ±0.2 



1.60 ±0.04 



1 1.2 dt 0.1 



2.03 ±0.02 



10 AG 0 is the free energy of unfolding in the absence of denaturant; ir^ is the 

dependence of the free energy of unfolding on GuHCl concentration. For solution 
conditions, see Figure 4 caption. 

Though the introduced mutations in the two loops certainly decreased the stability of 
1 5 Ubi4-K relative to wild-type Fn3, the stability of Ubi4 remains comparable to that of 
a "typical" globular protein. It should also be noted that the stabilities of the wild- 
type and Ubi4-K proteins were higher at pH 3.3 than at pH 7.5. 



of wild-type Fn3, but the solubility was improved by the addition of a solubility tail. 
20 Since the two mutated loops comprise the only differences between the wild-type 
and Ubi4 proteins, these loops must be the origin of the reduced solubility. At this 
point, it is not clear whether the aggregation of Ubi4-K is caused by interactions 
between the loops, or by interactions between the loops and the invariable regions of 
the Fn3 scaffold. 

25 The Ubi4-K protein retained the global fold of Fn3, showing that this 

scaffold can accommodate a large number of mutations in the two loops tested. 
Though the stability of the Ubi4-K protein is significantly lower than that of the 
wild-type Fn3 protein, the Ubi4 protein still has a conformational stability 
comparable to those for small globular proteins. The use of a highly stable domain 

30 as a scaffold is clearly advantageous for introducing mutations without affecting the 
global fold of the scaffold. In addition, the GuHCl-induced unfolding of the Ubi4 
protein is almost completely reversible. This allows the preparation of a correctly 
folded protein even when a Fn3 mutant is expressed in a misfolded form, as in 



The Ubi4 protein had a significantly reduced solubility as compared to that 



56 



p 

!i : ( 

Si 



inclusion bodies. T^feodest stability of Ubi4 in the conditioned for library 
screening indicates that Fn3 variants are folded on the phage surface. This suggests 
that a Fn3 clone is selected by its binding affinity in the folded form, not in a 
denatured form. Dickinson et al. proposed that Val 29 and Arg 30 in the BC loop 

5 stabilize Fn3. Val 29 makes contact with the hydrophobic core, and Arg 30 forms 
hydrogen bonds with Gly 52 and Val 75. In Ubi4-Fn3, Val 29 is replaced with Arg, 
while Arg 30 is conserved. The FG loop was also mutated in the library. This loop 
is flexible in the wild-type structure, and shows a large variation in length among 
human Fn3 domains (Main et al., 1992). These observations suggest that mutations 

10 in the FG loop may have less impact on stability. In addition, the N-terminal tail of 
Fn3 is adjacent to the molecular surface formed by the BC and FG loops (Figure 1 
and 17) and does not form a well-defined structure. Mutations in the N-terminal tail 
would not be expected to have strong detrimental effects on stability. Thus, residues 
in the N-terminal tail may be good sites for introducing additional mutations. 

15 

Example XVIII 
NMR Spectroscopy of Ubi4-Fn3 
Jbi4-Fn3 was dissolved in [ 2 H]-Gly HC1 buffer (20 mM, pH 3.3) containing 
NaCl (300smM) using an Amicon ultrafiltration unit. The final protein 
20 concentratiorKwas 1 mM. NMR experiments were performed on a Varian Unity 
INOVA 600 spirometer equipped with a triple-resonance probe with pulsed field 
gradient The prob^ temperature was set at 30°C. HSQC, TOCSY-HSQC and 
NOESY-HSQC spectta were recorded using published procedures (Kay et al., 1992; 
Zhang et al., 1994). NMR spectra were processed and analyzed using the NMRPipe 
25 and NMRView software (Jdfanson & Blevins, 1994; Delaglio et al., 1995) on UNIX 
workstations. Sequence-specific resonance assignments were made using standard 
procedures (Wuthrich, 1986; CIo\& Gronenborn, 1991). The assignments for 
wild-type Fn3 (Baron et al., 1992) w^e confirmed using a l5 N-labeled protein 
dissolved in sodium acetate buffer (50 m^l, pH 4.6) at 30°C. 



57 



The three-dimfflsional structure of Ubi4-K was characteffzed using this 
heteronuclear NMR spectroscopy method. A high quality spectrum could be 
collected on a I mM solution of l5 N-labeled Ubi4 (Figure 17a) at low pH. The 
linewidth of amide peaks of Ubi4-K was similar to that of wild-type Fn3, suggesting 
that Ubi4-K is monomeric under the conditions used. Complete assignments for 
backbone ! H and I5 N nuclei were achieved using standard 'H, l5 N double resonance 
techniques, except for a row of His residues in the N-terminal (His) 6 tag. There 
were a few weak peaks in the HSQC spectrum which appeared to originate from a 
minor species containing the N-terminal Met residue. Mass spectroscopy analysis 
showed that a majority of Ubi4-K does not contain the N-terminal Met residue. Fig. 
17 shows differences in l HN and l5 N chemical shifts between Ubi4-K and wild-type 
Fn3. Only small differences are observed in the chemical shifts, except for those in 
and near the mutated BC and FG loops. These results clearly indicate that Ubi4-K 
retains the global fold of Fn3, despite the extensive mutations in the two loops. A 
few residues in the N-terminal region, which is close to the two mutated loops, also 
exhibit significant chemical differences between the two proteins. An HSQC 
spectrum was also recorded on a 50 \xM sample of Ubi4-K in TBS. The spectrum 
was similar to that collected at low pH, indicating that the global conformation of 
Ubi4 is maintained between pH 7.5 and 3.3. 

The complete disclosure of all patents, patent documents and publications 
cited herein are incorporated by reference as if individually incorporated. The 
foregoing detailed description and examples have been given for clarity of 
understanding only. No unnecessary limitations are to be understood therefrom. 
The invention is not limited to the exact details shown and described for variations 
obvious to one skilled in the art will be included within the invention defined by the 
claims. 



58 



REFERENCES ^ ~ 

Alzari, P.N., Lascombe, M.-B. & Poljak, R.J. (1988) Three-dimensional structure of 

antibodies. Annu. Rev. Immunol. 6, 555-580. 
Archer, S. J., Ikura, M., Torchia, D. A. & Bax, A. (1991) An alternative 3D NMR 

technique for correlating backbone 15N with side chain Hb resonances in 

large proteins J. Magn. Reson. 95, 636-641. 
Aukhil, I., Joshi, P., Yan, Y. & Erickson, H. P. (1993) Cell- and heparin-binding 

domains of the hexabrachion arm identified by tenascin expression protein J. 

Biol. Chem. 268, 2542-2553. 
Barbas, C. F., Ill, Kang, A. S., Lerner, R. A. & Benkovic, S. J. (1991) Assembly of 

combinatorial antibody libraries on phage surfaces: the gene HI site. Proc. 

Natl. Acad. Sci. USA 88, 7978-7982. 
Barbas, C. R, IE, Bain, J. D., Hoekstra, D. M. & Lerner, R. A. (1992) Semisynthetic 

combinatorial libraries: A chemical solution to the diversity problem Proc. 

Natl. Acad. Sci. USA 89, 4457-4461. 
Baron, M., Main, A. L., Driscoll, P. C, Mardon, H. J., Boyd, J. & Campbell, I.D. 

(1992) 'H NMR assignment and secondary structure of the cell adhesion type 

II module of fibronectin Biochemistry 31, 2068-2073. 
Baron, M., Norman, D. G. & Campbell, I. D. (1991) Protein modules Trends 

Biochem. Sci. 16, 13-17. 
Bass, S., Greene, R. & Wells, J. A. (1990) Hormone phage: An enrichment method 

for variant proteins with altered binding properties Proteins: Struct. Funct. 

Genet. 5,309-314. 

Bax, A. & Grzesiek, S. (1993) Methodological advances in protein NMR. Acc. 

Chem. Res. 26, 131-138. 
Becktel, W. J. & Schellman, J. A. (1987) Protein stability curves. Biopolymer 26, 

1859-1877. 

Bhat, T. N., Bentley, G. A., Boulot, G., Greene, M. I., Tello, D., Dall'acqua, W., 

Souchon, H., Schwarz, F. P., Mariuzza, R. A. & Poljak, R. J. (1994) Bound 

59 



water molecoBr and conformational stabilization help IWaiate an antigen- 
antibody association. Proc. Natl. Acad. Sci. USA 91, 1089-1093. 
Bianchi, E., Venturing S., Pessi, A., Tramontane A. & Sollazzo, M. (1994) High 
level expression and rational mutagenesis of a designed protein, the 
minibody. From an insoluble to a soluble molecule. J. Mol. Biol. 236, 
649.659. 

Billeter, M., Neri, D., Otting, G., Qian, Y. Q. & Wiithrich, K. (1992) Precise vicinal 
coupling constants 3JHNa in proteins from nonlinear fits of J-modulated 
[ l5 N, 'H]-COSY experiments. /. Biomol. NMR 2, 251-21 A. 

Bodenhausen, G. & Ruben, D. J. (1980) Natural abundance nitrogen-15 NMR by 
enhanced heteronuclear spectroscopy. Chem. Phys. Lett. 69, 185-189. 

Bork, P. & Doolittle, R. F., PNAS 89:8990-8994 (1992). 

Bork, P. & Doolittle, R. F. (1992) Proposed acquisition of an animal protein domain 

by bacteria. Proc. Natl. Acad. Sci. USA 89, 8990-8994. 
Bork, P., Horn, L. & Sander, C. (1994) The immunoglobulin fold. Structural 

classification, sequence patterns and common core. J. Mol. Biol. 242, 309- 

320. 

Brunger, A. T. (1992) X-PLOR (Version 3.1): A system for X-ray crystallography 

and NMR., Yale Univ. Press, New Haven. 
Burke, T., Bolger, R., Checovich, W. & Lowery, R. (1996) in Phage display of 

peptides and proteins (Kay, B. K., Winter, J. and McCafferty, J., Ed.) Vol. 

pp305-326, Academic Press, San Diego. 
Campbell, I. D. & Spitzfaden, C. (1994) Building proteins with fibronectin type m 

modules Structure 2, 233-337. 
Chen, Y., Reizer, J., Saier, M. H., Fairbrother, W. J. & Wright, P. E. (1993) 

Mapping the binding interfaces of the proteins of the bacterial 

phaphotransferase system, HPr and IIAglc. Biochemistry 32, 32-37. 
Clackson & Wells, (1994) Trends Biotechnology 12, 173-184. 



60 



rn, A. M. (1991) Structure of larger prWetns in solution: 
Three- and four-dimensional heteronuclear NMR spectroscopy. Science 252, 
1390-1399. 

Corey, D. R., Shiau, A. K., Q., Y., Janowski, B. A. & Craik, C. S. (1993) Trypsin 

display on the surface of bacteriophage. Gene 128, 129-134. 
Davies, J. & Riechmann, L. (1996). Single antibody domains as small recognition 

units: design and in vitro antigen selection of camelized, human VH 

domains with improved protein stability. Protein Eng. . 2(6), 531-537. 
Davies, J. & Riechmann, L. (1995) Antobody VH domains as small recognition 

units. Bio/Technol. 13, 475-479. 
Delaglio, F., Grzesiek, S., Vuister, G. W., Zhu, G., Pfeifer, J. & Bax, A. (1995) 

NMRPipe: a multidimensional spectral processing system based on UNIX 

pipes. J. Biomol. NMR 6, 277-293. 
Deng, W. P. & Nickoloff, J. A. (1992) Site-directed mutagenesis of virtually any 

plasmid by eliminating a unique site. Anal. Biochem. 200, 81-88. 
deVos, A. M, Ultsch, M. & Kossiakoff, A. A. (1992) Human Growth hormone and 

extracellular domain of its receptor: crystal structure of the complex. Science 

255, 306-312. 

Dickinson, C. D., Veerapandian, B., Dai, X.P., Hamlin, R. C, Xuong, N.-H., 

Ruoslahti, E. & Ely, K. R. (1994) Crystal structure of the tenth type m cell 
adhesion module of human fibronectin /. Mol. Biol. 236, 1079-1092. 

Djavadi-Ohaniance, L., Goldberg, M. E. & Friguet, B. (1996) in Antibody 

Engineering. A Practical Approach (McCafferty, J., Hoogenboom, H. R. 
and Chiswell, D. J., Ed.) Vol. pp77-97, Oxford Univ. Press, Oxford. 

Dougall, W. C, Peterson, N. C. & Greene, M. I. (1994) Antibody-structure-based 
design of pharmacological agnets. Trends Biotechnol. 12, 372-379. 

Garrett, D. S., Powers, R., Gronenborn, A. M. & Clore, G. M. (1991) A common 

sense approach to peak picking in two-, three- and four-dimensional spectra 



61 



using automawcomputer analysis of contour diagrarri^^Magn. Reson. 95, 
214-220. 

Ghosh, G., Van Duyne, G., Ghosh, S. & Sigler, P. B. (1995) Structure of NF-kB p50 

homodimer bound to a kB site. Nature 373, 303-310. 
Gribskov, M., Devereux, J. & Burgess, R. R. (1984) The codon preference plot: 

graphic analysis of protein coding sequences and prediction of gene 

expression. Nuc. Acids. Res. 12, 539-549. 
Groneborn, A. M, Filpula, D. R., Essig, N. Z., Achari, A., Whitlow, M., Wingfield, 

P. T. & Clore, G. M. (1991) A novel, highly stabel fold of the 

immunoglobulin binding domain of Streptococcal protein G. Science 253, 

657-661. 

Gronenborn, A. M. & Clore, G. M. (1993) Identification of the contact surface of a 

Streptococcal protein G domain complexed with a human Fc fragment. J. 

Mol. Biol. 233, 331-335. 
Grzesiek, S., Anglister, J. & Bax, A. (1993) Correlation of backbone amide and 

aliphatic side-chain resonances in 13C/15N-enriched proteins by isotropic 

mixing of 13C magnetization. J. Magn. Reson. B 101, 1 14-1 19. 
Grzesiek, S. & Bax, A. (1992) Correlating backbone amide and side chain 

resonances in larger proteins by multiple relayed triple resonance NMR. J. 
s Am. Chem. Soc. 114, 6291-6293. 
Grzesiek, S. & Bax, A. (1993) Amino acid type determination in the sequential 

assignment procedure of unformly 13C/15N-enriched proteins. J. Biomol. 

NMR 3, 185-204. 

Harlow, E. & Lane, D. (1988) Antibodies. A laboratory manual, Cold Spring 

Harbor Laboratory, Cold Spring Harbor. 
Harpez, Y. & Chothia, C. (1994) Many of the immunoglobulin superfamily domains 

in cell adhesioin molecules and surface receptors belong to a new structural 

set which is close to that containing variable domains J. Mol. Biol. 238, 528- 

539. 



62 



Hawkins, R. E., RussSHrS. J., Baier, M. & Winter, G. (1 193) TrFcontnbution of 
contact and non-contact residues of antibody in the affinity of binding to 
antigen. The interaction of mutant D 1.3 antibodies with lysozyme. /. Mol. 
Biol. 234, 958-964. 

Holliger, P. et al., (1993) Proc. Natl. Acad. Sci. 90, 6444-6448. 

Hu, S-z., et al., Cancer Res. 56:3055-3061 (1996). 

Dcura, M. & Bax, A. (1992) Isotope-filtered 2D NMR of a protein-peptide complex: 

study of a skeletal muscle myosin light chain kinase fragment bound to 

calmodulin. J. Am. Chem. Soc. 114, 2433-2440. 
Dcura, M, Kay, L. E. & Bax, A. (1991) Improved three-dimensional 1H-13C-1H 

correlation spectroscopy of a 13C-labeled protein using constant-time 

evolution. J. Biomol. NMR 1, 299-304. 
Jacobs, J. & Schultz, P. G. (1987) Catalytic antibodies. J. Am. Chem. Soc. 109, 

2174-2176. 

Janda, K. D., et al., Science 275:945-948 (1997). 

Jones, E. Y. (1993) The immunoglobulin superfamily Curr. Opinion struct. Biol. 3, 
846-852. 

Jones, P. T., Dear, P. H., Foote, J., Neuberger, M. S. & Winter, G. (1986) Replacing 

the complementarity-determining regions in a human antibody with those 

from a mouse Nature 321, 522-525. 
Kabsch, W. & Sander, C. (1983) Dictionary of protein secondary structure: pattern 

recognition of hydrogen-bonded and geometrical features. Biopolymers 22, 

2577-2637. 

Kay, L. E. (1995) Field gradient techniques in NMR spectroscopy. Curr. Opinion 

Struct. Biol. 5,674-681. 
Kay, L. E„ Keifer, P. & Saarinen, T. (1992) Pure absorption gradient enhanced 

heteronuclear single quantum correlation spectroscopy with improved 

sensitivity 7. Am. Chem. Soc. 114, 10663-10665. 



63 



Kay, L. E., Xu, G.-Y^Singer, A. U. (1993) A Gradient-Enhftd HCCH-TOCSY 
Experiment for Recording Side-Chain 1H and 13C Correlations in H20 
Samples of Proteins. J. Magn. Reson. B 101, 333-337. 
Koide, S., Dyson, H. J. & Wright, P. E. (1993) Characterization of a folding 
intermediate of apoplastcyanin trapped by proline isomerization. 
Biochemistry 32, 12299-12310. 
Kornblihtt, A. R., Umezawa, K., Vibe-Pederson, K. & Baralle, F. E. (1985) Primary 
structure of human fibronectin: differential splicing may generate at least 10 
polypeptides from a single gene EMBO J. 4, 1755-1759. 
Kraulis, P. (1991) MOLSCRIPT: a program to produce both detailed and scnematic 

plots of protein structures. J. Appl. Cryst. 24, 946-950. 
Kunkel, T. A. (1985) Rapid and efficient site-specific mutagenesis without 

phenotypic selection. Proc. Natl. Acad. Sci. USA 82, 488-492. 
Leahy, D. J., Hendrickson, W. A., Aukhil, I. & Erickson, H. P. (1992) Structure of a 
fibronectin type HI domain from tenascin phased by MAD analysis of the 
selenomethionlyl protein Science 258, 987-991. 
Lee, W., Revington, M. J., Arrowsmith, C. & Kay, L. E. (1994) A pulsed field 

gradient isotope-filtered 3D 13C HMQC-NOESY experiment for extracting 
intermolecular NOE contacts in molecular complexes. FEBS lett. 350, 87-90. 
Lerner, R. A. & Barbas HI, C. F., Acta Chemica Scandinavica, 50 672-678 (1996). 
Lesk, A. M. & Tramontane A. (1992) Antibody structure and structural predictions 
useful in guiding antibody engineering. In Antibody engineering. A practical 
guide. (Borrebaeck, C. A. K., Ed.) Vol. W. H. Freeman & Co., New York. 
Li, B., Tom, J. Y., Oare, D., Yen, R., Fairbrother, W. J., Wells, J. A. & 

Cunningham, B. C. (1995) Minimization of a polypeptide hormone Science 
270, 1657-1660. 

Litvinovich, S. V., Novokhatny, V. V., Brew, S. A. & Inhgam, K. C. (1992) 

Reversible unfolding of an isolated heparin and DNA binding fragment, the 
first type HI module from fibronectin. Biochim. Biophys. Acta 1119, 57-62. 



64 



Logan, T. M., Olejn Jfc E. T., Xu, R. X. & Fesik, S. W. (199^fede chain and 
backbone assignments in isotopically labeled proteins from two 
heteronuclear triple resonance experiments. FEBS lett. 314, 413-418. 

Main, A. L., Harvey, T. S., Baron, M., Boyd, J. & Campbell, I. D. (1992) The three- 
dimensional structure of the tenth type HI module of fibronectin: an insight 
into RGD-mediated interactions. Cell 71, 671-678. 

Masat, L., et al., (1994) PNAS 91:893-896. 

Martin, F., Toniatti, C, Ciliberto, G., Cortese, R. & Sollazzo, M. (1994) The 

affinity-selection of a minibody polypeptide inhibitor of human interleukin- 

6. EMBO J. 13, 5303-5309. 
Martin, M. T., Drug Discov. Today, 1:239-247 (1996). 
McCafferty, J., Griffiths, A. D., Winter, G. & Chiswell, D. J. (1990) Phage 

antibodies: filamentous phage displaying antibody variable domains. Nature 

348, 552-554. 

McConnell, S. J., & Hoess, R. H., J.Mol. Biol. 250:460-470 (1995). 

Metzler, W. J., Leiting, B., Pryor, K., Mueller, L. & Farmer, B. T. I. (1996) The 

three-dimensional solution structure of the SH2 domain from p55blk kinase. 

Biochemistry 35, 6201-621 1. 
Minor, D. L. J. & Kim, P. S. (1994) Measurement of the P-sheet-forming 

propensities of amino acids. Nature 367, 660-663. 
Muhandiram, D. R., Xu, G. Y. & Kay, L. E. (1993) An enhanced-sensitivity pure 

absorption gradient 4D 15N,13C-edited NOESY experiment. J. Biomol. 

NMR 3, 463-470. 

Miiller, C. W., Rey, F. A., Sodeoka, M., Verdine, G. L. & Harrison, S. C. (1995) 
Structure of the NH-kB p50 homodimer boound to DNA. Nature 373, 311- 
117. 

Nilges, M., Clore, G. M. & Gronenborn, A. M. (1988) Determination of three- 
dimensional structures of proteins from interproton distance data by hybrid 



65 



distance geonSEry-dynamical simulated annealing calcorations. FEBS lett. 
229,317-324. 

Nilges, M., Kuszewski, J. & Briinger, A. T. (1991) in Computational aspects of the 
study of biological macromolecules by nuclear magnetic resonance 
spectroscopy. (Hoch, J. C, Poulsen, F. M. and Redfield, C, Ed.) Vol. pp. 
451-455, Plenum Press, New York. 
O'Neil et al., (1994) in Techniques in Protein Chemistry V(Crabb, L., ed.) pp.5 17- 

524, Academic Press, San Diego. 
O'Neil, K. T. & Hoess, R. H. (1995) Phage display: protein engineering by directed 

evolution. Curr. Opinion Struct. Biol. 5, 443-449. 
Pace, C. N., Shirley, B. A. & Thomson, J. A. (1989) Measuring the conformational 
stability of a protein. In Protein structure. A practical approach (Creighton, 
T. E., Ed.) Vol. pp 31 1-330, IRL Press, Oxford. 
Parmley, S. F. & Smith, G. P. (1988) Antibody-selectable filamentous fd phage 

vectors: affinity purification of target genes Gene 73, 305-318. 
Pascal, S. M., Muhandiram, D. R., Yamazaki, T., Forman-Kay, J. D. & Kay, L. E. 
(1994a) Simultaneous acquisition of 15N- and 13C-edited NOE spectra of 
proteins dissolved in H20. J. Magn. Reson. B 103, 197-201. 
Pascal, S. M., Singer, A. U., Gish, G., Yamazaki, T., Shoelson, S. E., Pawson, T., 
Kay, L. E. & Forman-Kay, J. D. (1994b) Nuclear magnetic resonance 
structure of an SH2 domain of phospholipase C-gl complexed with a high 
affinity binding peptide. Cell 77, 461-472. 
Pessi, A., Bianchi, E., Crameri, A., Venturini, S., Tramontano, A. & Sollazzo, M. 
(1993) A designed metal-binding protein with a novel fold. Nature 362, 
3678-369. 

Rees, A. R., Staunton, D., Webster, D. M., Searle, S. J., Henry, A. H. & Pedersen, J. 
T. (1994) Antibody design: beyond the natural limits. Trends Biotechnol. 12, 
199-206. 

Roberts et al., (1992) Proc. Natl. Acad. Sci. USA 89, 2429-2433. 



66 



Rosenblum, J. S. & BaSISs, C. F. I. (1995) in Antobody Engineeringym>rvenbatck, C. A. K M 

Ed.) Vol. pp89-l 16, Oxford University Press, Oxford. 
Sambrook, J., Fritsch, E. F. & Maniatis, T. (1989) Molecular Cloning: A laboratory manual, 

2nd Ed., Cold Spring Harbor Laboratory, Cold Spring Harbor. 
Sandhu, G. S., Aleff, R. A. & Kline, B. C. (1992) Dual asymmetric PCR: one-step 

construction of synthetic genes. BioTech. 12, 14-16. 
Santoro, M. M. & Bolen, D. W. (1988) Unfolding free energy changes determined 
by the linear extrapolation method. 1. Unfolding of phenylmethanesulfonyl 
a-chymotrypsin using different denaturants Biochemistry 27, 8063-8068. 
Smith, G. P. & Scott, J. K. (1993) Libraries of peptides and proteins displayed on 

filamentous phage. Methods Enzymol. 217, 228-257. 
Smith, G. P. (1985) Filamentous fusion phage: novel expression vectors that display 

cloned antigens on the virion surface. Science 228, 1315-1317. 
Smith, C. K., Munson, M. & Regan, L. (1995). Studying cc-helix and p-sheet 

formation in small proteins. Techniques Prot. Chem.. 6, 323-332. 
Smith, C. K., Withka, J. M. & Regan, L. (1994) A thermodynamic scale for the b-sheet 

forming tendencies of the amino acids. Biochemistry 33, 5510-5517. 
Smyth, M. L. & von Itzstein, M. (1994) Design and synthesis of a biologically active 
antibody mimic based on an antibody-antigen crystal structure. 7. Am. Chem. 
Soc. 116, 2725-2733. 
Studier, F. W., Rosenberg, A. H., Dunn, J. J. & Dubendorff, J. W. (1990) Use of T7 
RNA polymerase to direct expression of cloned genes Methods Enzymol 
185, 60-89. 

Suzuki, H. (1994) Recent advances in abzyme studies. /. Biochem. 115, 623-628. 
Tello, D., Goldbaum, F. A., Mariuzza, R. A., Ysern, X., Schwarz, F. P. & Poljak, R. 

J. (1993) Immunoglobulin superfamily interactions. Biochem. Soc. Trans. 

21, 943-946. 

Thomas, N. R. (1994) Hapten design for the generation of catalytic antibodies. 
Appl. Biochem. Biotech. 47, 345-372. 

67 





alpha- and beWcarbon resonances in proteins J. Magn^eson. B 101, 201-205. 
Wu, T. T., Johnson, G. & Kabat, E. A. (1993) Length distribution of CDRH3 in 

antibodies Proteins: Struct. Fund. Genet. 16, 1-7. 
Wuthrich, K. (1986) NMR of proteins and nucleic acids, John Wiley & Sons, New 
York. 

Yamazaki, T., Forman-Kay, J. D. & Kay, L. E. (1993) Two-Dimensional NMR 

Experiments for Correlating 13C-beta and lH-delta/epsilon Chemical Shifts 
of Aromatic Residues in 13C-Labeled Proteins via Scalar Couplings. J. Am. 
Chem. Soc. 115, 1 1054. 

Zhang, O., Kay, L. E., Olivier, J. P. & Forman-Kay, J. D. (1994) Backbone 1H and 
15N resonance assignments of the N-terminal SH3 domain of drk in folded 
and unfolded states using enhanced-sensitivity pulsed field gradient NMR 
techniques. J. Biomol. NMR 4, 845-858. 




69 



